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Forth computer 

With applications ranging from video games to research and process control this 
microcomputer combines the powers of Forth, a fast, threaded computer 
language/operating system, with an eight-bit processor having 16 bit internal architecture. 



Today a home or personal computer can 
more than match at lower cost the per- 
formance of a typical mid-1970s minicom- 
puter. Then a minicomputer costing tens 
of thousands of pounds would have a 
memory of less than 32K words with a 
cycle time of about a microsecond and a 
Teletype terminal capable of ten characters 
per second. Disc-drive memory was rare 
and expensive and double-precision/float- 
ing-point instructions would be executed 
by software. This microcomputer design 
costing a few hundred pounds has a 48K 
read/write memory operating at 666ns, 
further 8Kbyte rom containing the operat- 
ing system, a 100-character-per-second 
terminal and a 200Kbyte disc memory. 





Forth 

computer 


1970s 

minicom- 

puter 


Memory size 


56K (48K ram 




Memory 


8K rom) 


64K ram 


speed 

C.p.u. 16-bit 


666ns 


960ns 


add time 
Output 


4.6ps 


1.96 p.s 


peripherals composite 
video 
RS232 






8 ports 


64 ports 


Input 






peripherals 


parallel 


standard 




keyboard 
RS232 
8 ports 


peripherals 


Disc storage 


200Kbyte/ 






drive 


5Mbyte/drive 


Access time 


333ms 


35ms 


Cost 


£100-500 


£10,000- 

100,000 



The cost of developing control software 
and language application packages is the 
main reason why low-cost microprocessors 
have not destroyed the minicomputer in- 
dustry. It will be a long time before any 
microprocessor has the software support of 
the PDP11! Further, when designing a 
home computer from the i.cs upwards one 
does not have the support of other compu- 
ters to develop the software on and one 
cannot afford to develop the software 
alone. For these reasons the control 
program was chosen from those already 
available. This also applied to the choice of 
language; I was not willing to start from 
the bottom with machine code, for one 
sees too little reward for the effort of key- 
ing in programs on a hexadecimal keypad, 
nor was I prepared to design a ‘bootstrap’ 
rom that loaded the operating system in 
from disc, for I felt it an unnecessary com- 



mitment while the rest of the system was 
unproven. 

Language/operating-system 

choice 

The most popular operating system and 
language in the microcomputer field are 
CP/M and Basic respectively. Although 
Basic is readily available, in for example 
the INS8298 rom for the 8080, I was not 
prepared to use the language for reasons 
too manv to mention but summed up by 
Dijkstra who said “It is practically impos- 
sible to teach good programming to stu- 
dents that have had prior exposure to 
Basic.” He seems equally impressed by 
most other languages, including Fortran, 
PL/1, Cobol, APL and Ada. 

My first choice would have been Pascal 
but for this application Forth appeared to 
be the best choice. Besides being a lan- 
guage, Forth forms the basis of an operat- 
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ing system and the Forth Interest Group 2 
have made FIG Forth a public-domain 
product. The language is efficient, which 
is important when using a processor with a 
limited address range of 64K and it is 
interactive, avoiding the traps of edit-com- 
pile/load-run phases which are a left over 
of batch-processing systems. FIG Forth is 
a single-operator, single-task operating 
system but it has ‘hooks’ which allow it to 
be expanded into a multi-operator, multi- 
task system. It promotes good program- 
ming habits in that its programs are 
structured in blocks and work from top to 
bottom. 

The language has drawbacks - unfa- 
miliar notation, no file structure and poor 
data structures — but it is readily available 
and has more advantages than disadvan- 
tages. The power and flexibility in Forth 
allows the operator to expand the language 
and add any desired feature, and a new 
version of Forth may be placed on disc by 
editing and compiled using the resident 
language to give a completely user-defined 
version. 

Forth 

Details of Forth and how it operates are 
available (ref. 3) and the following is a 
brief summary. Forth uses 16bit arith- 
metic and reverse Polish notation, which 
implies the use of a data stack. Control 
between executable statements, referred to 



as a word, is accomplished by the use of 
indirect-threaded code and a control stack 
which is separate from the data stack. Fea- 
tures of Forth not found in Basic are vir- 
tual memory, compiling, extensibility and 
vocabularies. These features make better 
use of the processor resources and a 
program written in Forth will use less 
memory and run faster than its Basic equi- 
valent, often by a factor of ten or more in 
both cases. As Forth compiles the ‘Eng- 
lish’ program into a form readable by the 
processor (threaded code) the operating 
speed will always be faster than when 
using Basic which stores the program as 
text. Memory space taken up by compiled 
code is much smaller than would be taken 
up by its equivalent in English text so 
larger programs are possible in a limited 
memory space. 

The virtual-memory feature allows the 
programmer to treat disc storage as proces- 
sor memory so memory space is not 
limited by the processor but by the disc. 
Data is moved to and from the disc by the 
operating system so the programmer need 
not be concerned with the problem of 
mapping the disc memory. Vocabularies 
allow the programmer to keep different 
application programs in memory whicllare 
physically concurrent but logically 
separate. Further, there are features found 
in Forth that are not generally available in 
Basic such as recursion, extensibility and 
self-compiling. Recursion allows a portion 
of the code to use itself more than once at 
the same time and extensibility is the abil- 
ity of Forth to define new control words. 

Memory choice 

Before selecting a processor for the compu- 
ter, another design decision has to be 
made. This concerns memory, in particu- 
lar what type and how much to use. In 
time, memory will always become too 
small and too slow because of the pro- 
grammer’s rising expectations of what the 
computer should do 4 , so 4116 dynamic 
rams were chosen because they offer the 
best performance in terms of cost, size and 
power consumption when compared with 
static rams such as the 2114. This decision 
does present some problems in that refresh 
circuits and three-rail supplies are re 
quired. Because dynamic rams are prone 
to ‘soft’ errors, parity-checking circuits are 
included in the design. 

Processor choice 

The Z80 microprocessor contains dynamic 
ram refresh circuits and CP/M is written in 
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i/0 address 





Cl 10>J 25V Al +12V supply 
(4 places) 

^2 lOp 25V Al - 5V supply 
( 4 places] 

^3 100/j 25V Al +5V supply 
(2 places) 

C 4 [] lOOn between + 5V/gnd 
(8 places) 

x x 1 00 n between *12V/gnd 
and 10 Or between - SV/gnd 
(in 27 places) 

z Single in line 10k pullups 
(in 2 places) 



i.cs are numbered as 
follows, IC43 means 4th 
row down 3rd one in 
i.e. IC43 is an LS02 



16 d ip header with analogue components 




Complete Forth computer system has a 
48K memory, floppy disc storage and 
memory-data parity checking but the 
system may be used with 16K ram and 
without disc storage and parity checking to 
reduce costs. Wire wrapping allows the 
computer to be built on one relatively small 
board with a minimum of bus buffering. A 
further small board holds the disc 
controller and i/o-port hardware. The 
system can be set to read most disc 
formats. 



Z80 machine code which surely explains 
why it is the most widely sold processor, 
but to use Forth, the most suitable 8bit 
microprocessor is the 6809. Although most 
of Forth is written in Forth, the computer 
must execute some machine code to in- 
terpret the most primitive Forth instruc- 
tions. The 6809 has indexed addressing 
modes (see “6809 evaluation system” by 
R. Coates, Wireless World July 1980) which 
suit stack operations and as said earlier, 
Forth uses two stacks. These examples of 
stack addition illustrate the merits of the 
6809; they represent code of the Forth 
word ‘+’ for various processors. 



6809 




Z80/8085 




6800 




PULU 


D 


POP 


D 


PULB 




ADDD 


0,U 


POP 


H 


PULA 




STD 


o,u 


DAD 


D 


TSX 








PUSH 


H 


ADDB 


1,X 


6502 








ADCA 


0,X 


CLC 




8088 




STB 


1,X 


LDA 


o,x 


POP 


AX 


STA 


0,X 


ADC 


2,X 


POP 


BX 




STA 


2,X 


ADDAX3X 






LDA 


1,X 


PUSH 


AX 






ADC 


3,X 










STA 


3,X 










INX 












INX 













Secondly, the 6809 instruction set is parti- 
cularly suited to code the crucial Forth 
word ‘next’. The speed at which ‘next’ is 
executed determines the performance of 
the Forth system since this word controls 
the indirect-threaded code. ‘Next’ is called 
the inner (or address) interpeter to distin- 
guish it from Forth’s text interpreter 
which performs the function of a compiler. 




Having worked in Hewlett Packard's 
production and systems-engineering 
departments, Brian Woodroffe currently 
works with the company's South 
Queensferry research and development 
group and has recently been involved 
with designing the microprocessor 
control section of the HP3724/25/26A 
baseband analyser. Brian obtained a BA 
degree in engineering and economics at 
Downing College, Cambridge in 1970 
and an MA in 1975. His computing 
interests include real-time control, 
languages and microprocessor 
graphics but outside electronics, his 
main interest - rifle shooting, in which 
he has represented Scotland in full bore 
- has been curtailed through part-time 
studies for an M.Sc degree in computer 
systems engineering at Edinburgh 
University. 



Machine code in the computer emulates 
Forth operation, the Y register taking on 
the role of the Forth program counter, and 
the Forth instruction-fetch cycle is a ‘next’ 
machine-code routine. So you can see that 
the processor choice is dominated by the 
speed and memory cost of the ‘next’ opera- 
tion. Equivalent Forth ‘next’ operations 
for some microprocessors are listed below. 
Because the 6809 ‘next’ operation is so 
short, it may be copied in line as required 
resulting in improved performance 
through avoiding the JMP NEXT instruc- 
tion required for most processors. 



6809 


8088 


6502 


LDX 0,Y+ + 


JMP NEXT 


JMP NEXT 


JMP [0,X] 


LODS AX 


LDY #1 


(4.11) 


MOV BX,AX 


LDY [IP],Y 




MOV DX,BX 


STA W+l 


Z80/8085 


INC DX 


DEY 


JMP NEXT 


JMP WORD 


LDY (IP),Y 


LDAXB 


PTR (BX) 


STAW 


INXB 


(3.19) 


CLC 


MOVL,A 


6800 


LDA IP 


LDAXB 


JMP NEXT 


ADC #1 


INXB 


LDX IP 


STA IP' 


MOV H,A 


INX 


BCC$+4 


MOVE,M 


INX 


INC IP+1 


INX H 


STX IP 


JMP W-l 


MOVD,M 


LDX0,X 
STX W 


(1.25) 


XCHG 




PCHL 


LDX0.X 




(1.37) 


JMP 0,X 

(1) 
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Wait-state generator 




Values in parentheses are merit figures 
obtained by multiplying the number of 
processor cycles by the processor cycle 
time then dividing by the memory-access 
time in the processor cycle. It is interesting 
to note that the 6809 fares better than the 
mfiEe recently introduced 8088. This is 
especially so when one realises that the 
8088 has a 16bit arithmetic unit whereas 
the. 6809 in common with the other proces- 
-sbrs'noted has an Sbit arithmetic and logic 
unit (a.l.u.). 

Finally, the register set of the 6809 
exactly matches that which is required to 
operate Forth. 

6809 register Forth operation 

S system stack RP return stack 

pointer pointer 

U user stack SP data stack 

pointer pointer 

Y index register IP instruction 

pointer 

X index register W code field 

pointer 

D accumulator accumulator 

Peripheral devices 

Having chosen Forth and the processor to 
rim it on, other design requirements are 
easily determined. These were selected to 
maximize the number of peripheral de- 
vices that can be easily driven. First a 
floppy disc was included to provide a mod- 
est amount of non-volatile memory with 
mu.ch faster operation than tape recorders. 
Mini floppy discs were chosen for two 
reasons, firstly because they are cheap and 
secondly because the data rate of eight- 
inch double-density drives is too high for 
most microprocessors to handle without 
direct-memory access. Further, eight-inch 
drives normally require phase-locked loop 
clock-recovery circuits and also a mains 
supply. 

Three-inch disc drives from Sony were 
investigated but the data transfer rate is 



high so that only single-density recording 
could be used, which would mean wasting 
half of the data-storage capacity. Both 
these drives and eight-inch types can be 
used with the system, provided they run in 
single density. Processor memory in this 
system is greater than 40Kbyte so a disc 
capacity of greater than 400Kbyte is 
reasonable; one double-sided floppy-disc 
drive meets this requirement. It is interest- 
ing to note that the BBC Micro and Atom 
computer can only use single-density 5 l Ain 
disc drives because of data-rate problems. 

Different types of terminal are accom- 
modated. Operating-system words for ter- 
minals, KEY, TERMINAL and EMIT, 
are vectored so that they may be changed 
on-line between terminal types. At switch- 
on the system automatically sets vectors 
for the available terminals. These termi- 
nals are either serial RS232 or 8bit parallel 
for a keyboard such as the RCA VP60 1/6 1 1 
and integral video compatible with 625- 
line tv, displaying 1,024 characters in 16 
lines (the EF96364B controller may be 
used for 525 lines). The video section has 
its own memory, leaving 48K of memory 
free for other programs. Bit-mapped 
graphics video is best handled through a 
secondary processor connected to the user 
ports. A number of definable i/o ports are 



spare to allow for expansion of the system. 
Certain design features were included to 
reduce cost. By keeping the computer 
system down to one board, bus drivers 
necessary to overcome capacitance encoun- 
tered in larger systems are avoided. 
Another reason for avoiding these buffers 
is that they cause delays which eat into the 
access time available from communicating 
devices. The switch-mode power supply 
used means that a readily obtainable trans- 
former with a single secondary winding 
may be used to provide all three rails (+ 12, 
+5 and — 5V). TO? 

Next article describes computer circuitry. 
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continued from page 50 

Z8 Basic listing for eight-channel a to d 
converter (@ = byte, % s hexadecimal) 

1 PRINT "SILICONIX LD120/121ATO Z8 
INTERFACE" 

2 PRINT "HIT ANY KEY TO RUN";: GO 
@%61 ,%07:R= USR{%54) 



45 IN =1 : Z=%80 
50 @2=Z 

55 @3=%A0:03=00 
60 @2=%00 

70 W *W+1:IFW<15THEN70 
80 W =0:GO@%1400(m,c. routine for 
collection of data) 

85 IF N=1 THEN X=A: Y=B: GOTO 130 
90 IF N=2THEN X=C: Y=D: GOTO 130 
95 IF N=3THEN X=E: Y=F: GOTO 130 
100 IF N =4 THEN X=G; Y=H: GOT0 130 
105 IFN=5THENX=I: Y=J:GOTO130 
110 IF N=6THEN X=K: Y=L:GOTO130 
115 IF N =7 THEN X=M: Y=N: GOT0 130 
120 IF N =8 THEN X=0: ' Y=P: GOTO 130 
125 GO TO 45 
130 Z=Z+%10 

135 V=@%26+ @%25x 1 0+@%24x 100 
+ (a>%23x 1 000+(a>%22x 1 0000 



140 IF V>X THEN PRINT "CHANNEL"; 

N;"OVERRANGE";:GOTO 1000 
145 IFV<YTHEN PRINT "CHANNEL"; 

N;"UNDERRANGE";:GO TO 1000 
150 N-N+1: GOTO50 
1000 GO@%61,%07:PRINT@%22;” ."; 

@%24; @%25; @%26; 

1010 PR!NT“MAX”;X;"MIN”;Y 
1020 N=N+1 : GOTO50 

•Limits entered here for process monitoring. 



Machine code routine 



Line 


Assembler 


Hex 


200 


LD % F7, # % 41 


E6F7 41 


210 


LD % F6, # % OF 


E6 F7 OF 


220 


AND 3, #% 04 


56 03 04 


230 


JRZ, * 220 


6B FB 


240 


LD % 22, 2 


E402 22 


250 


AND 3 # % 04 


56 03 04 


260 


JR N2, * 250 


EB FB 


270 


CLR%21 


B0 21 


280 


AND 3 # % 08 


56 03 08 


290 


JRZ, *270 


6B FB 


300 


PUSH 2 


70 02 


310 


INC % 21 


2021 


320 


CP % 21, #4 


A6 21 04 


330 


JRZ 2, * 350 


6B 07 


340 


AND 3, #08 


56 03 08 


350 


JR N2 * 320 


EB FB 


360 


JR * 270 


8BEB 


370 


POP % 26 


50 26 


380 


POP % 25 


50 25 


390 


POP % 24 


50 24 


400 


POP % 23 


50 23 


410 


RET 


AF 
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Forth computer 

In describing memory and i/o interface circuits surrounding the 6809 microprocessor, 
Brian Woodroffe introduces more features of his FIG Forth computer in this second article. 



The system may be used in partial form. 
Operating-system and language software 
exist in eprom, so the computer will work 
without a floppy-disc drive. Many compu- 
ters use eprom as a bootstrap to load an 
operating system from disc, making a disc 
drive mandatory. Although omitting the 
disc drive reduces cost by almost half, 
virtual-memory features of Forth are lost, 
resulting in a significant degradation of 
performance. Fewer than one third of the 
memory devices are essential. Parity-error 
checking may be omitted. When the 
system is turned on, it only demands 16K 
of ram and as more is added the memory 
map is changed on line, Table 1 ( see over). 

Circuit description 

Memory. Eproms containing fixed in- 
structions of the Forth machine and 
M6809 peripherals pose few problems. 
These devices occupy the top 16K memory 
locations because the 6809 reset vector is in 
this area and decoding is simple using a 
dual two-to-four-line demultiplexer i.c. 
(LS139). Dynamic ram occupies the re- 
maining 48K addresses from 0000 to 
BFFF. Logic i.cs used to glue the main 
items together are low-power Schottky de- 
vices, chosen for their speed and low 
power consumption. Standard t.t.l. parts 
could be used, except in the timing chain 
for the dynamic rams and on the micropro- 
cessor memory and address buses; nmos 
microprocessor parts have very low driving 
capability and low-power Schottky inputs 
require less current than standard t.t.l. 

Dynamic rams consist of an X-Y matrix 
of capacitor storage cells. Access to a bit 
(storage cell) is gained by first addressing 
the matrix row. This address is clocked in 
by the falling edge of the row-address 
strobe (RAS) and data from all 128 cells in 
the row are transferred to row buffers. 
When the column-address strobe (CAS) is 
true, i.e. low, the column address on the 
address pins selects one of the row buffers, 
causing its data to be passed to the output 
pin. Timing constraints on these actions 
are fortunately not stringent relative to the 
time available in a processor cycle . 

Multiplexing of the 14 address lines onto 
the seven address pins is done with an 
13242 multiplexer. In this design, writing 
is carried out by the early- write cycle. 
Within the early-write cycle the write sig- 
nal is made true before the column-address 
strobe acts. When CAS becomes true, data 
on the data input overwrites that of the 
selected row buffer. Then when the 
address strobes become false, data from 
the row buffers are returned to their res- 
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pective cells, so writing the input data into 
the X-Y matrix. 

Two clocks, E and Q, divide the 6809 
processor cycle into four parts. The first 
quarter of the cycle is used to precharge 
the rams and as dynamic rams consume 
most power when the row-address strobe is 
applied, the selected bank of rams only 
receives this strobe on the rising edge of 
clock Q. The address multiplexer is then 
switched by a delayed Q-clock edge to 
apply column addresses, leaving sufficeint 
settling time before the E-clock acts. 

During a reading cycle the column- 
address strobe is made true half way 
through a cycle (rising edge of E Clock) so 
that data may be made available by the 
RAS-selected rams, through the LS245 
buffer, to the M6809 before its set-up 
time. All. of the rams receive CAS but only 
those receiving RAS pass data to the bus. 



Write cycle 




E 5V/div. 
RAS 2V/div. 



CAS 2V/div. 

R/W SV/div. 
200ns/div. 




Reod cycle 




During a writing cycle data is not made 
available by the M6809 until the second 
half of the cycle so CAS is delayed until the 
falling edge of the Q signal. 

Refresh generator 

Storage cells in dynamic rams, being capa- 
citors, lose their charge so they must be 
‘refreshed’. Any memory action refreshes 
the selected row through data being read 
into the refresh buffer and returned at the 
end of the cycle. Unfortunately, program 
flow will not normally refresh all the ram 
rows in the allotted time of 2ms and a 
refresh generator is required. 

There are three ways of refreshing rams. 
In burst refresh, normal processor action is 
suspended and the refresh generator cycles 
through all 128 rows (for a 16K ram) and 
returns control to the processor for the 
remainder of the 2ms. This results in the 
processor stopping for 128 memory cycles 
(85us at the clock speed used). Such a time 
lapse is unacceptable in this application for 
the disc drive can require communication 
with the microprocessor once every 32ps 
during sector read/write operations. 

So, distributed refreshing is required, 
that is, each successive row is refreshed at 
Ups intervals. Distributed refresh genera- 
tors demand that the processor does not 
have access to memory while the row is 
refreshed. The processor may be stopped 
for this period but a more efficient method 
is to use a circuit that recognizes when the 
processor is- not using memory and 
performs what is called a distributed hid- 
den-refresh cycle. This method was cho- 
sen. 

The refresh generator divides time into 
14 cycle quantums using an LS 163 counter 
and generates a refresh-request signal once 
each period (Ups x 128 cycles = 1.7ms). 
By monitoring address lines A1445 during 
the first quarter cycle, the generator knows 
when the processor does not require access 
to memory. Having recognized this it gen- 
erates a refresh-request signal and the 
13242 multiplexer places the refresh 
address on the ram address lines and all 
row-address signals are set true for a quar- 
ter of a processor cycle. During the refresh 
cycle the column-address strobe is false to 
inhibit the rams. The address multiplexer 
advances for the next address and the gem* 
erator does not demand further refreshes 
since a flip-flop is set. 

It is unlikely that the M6809 will mak e 
14 consecutive memory cycles since all 
instructions except NOP, SEX and DAA 
provide non-memory cycles. Should this 
happen, the refresh flip-flop being reset 
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[A*- 

c.p.u. 1*6“ 
address |Ai — 

bus [a 0 - 

R/W- 



Cisplay code selector 



ASCII parallel keybo ard 
read requestllow trull 



Read acknowledge 

(low true) 



26 IC75 . 

EF9364 

^ Display processor 



Keyboard data— 
lines _ 



, Kl5|3|6|7|l0|l 112 
2 B C A D P T CLR QD 9 
— CK 10 — 

^7 IC 0i> LS161 . 5 



Keyboard connections to i/o data bus 



8 bit parallel i/o bus 

tor user ports and keyboard and disc 



To disc drive interface 



p.I.a. D'6 _ 

To floppy disc circuit 



p i ct. D5 
Reserved n.c. 



p.i.a. AD4 
«= i/o strobe 



v*8 read strobes 
u=S write strobes 
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Table 1 . Example of how the memory map may be changed when 
more than 16K of ram is used. 



FORTH HEX 

SMAX DUP @ 4000 + SWAP ! 

SO DUP @ 4000+ SWAP ! 

SP! SMAX DUP.@ 4000 - SWAP 
R0 DUP (a, 4000 + SWAP! 

TIB DUP @4000 + SWAP ! 

7 FIRST @ DUP @ 4000 + SWAP ! 
' LIMIT @ DUP @ 4000 + SWAP ! 
FIRST DUP PREV! USE! 

DPMAX OUP @ 4000 + SWAP ! 
DECIMAL 



( allow more data stack! 

( move data stack) 

( reset data stack) 

( move return stack) 

( move terminal input buffer) 

{ move Forth virtual memory buffers) 

(IE 'FIRST' and 'LIMIT') 

{ point virt. memory pointers to virt. memory) 
{ move limit of dictionary up) 

( return to decimal arithmetic) 



and the counter carry being set (refresh 
quantum finished), processor action is sus- 
pended by a dummy direct-memory-access 
cycle which guarantees a non-memory- 
access cycle. 



Parity checking 

Capacitance used to store data in dynamic 
rams is so small that naturally occurring 
charged particles (alpha particles) have a 
charge great enough to corrupt data should 
they hit a ceil. Improved coatings on dy- 
namic-ram dies have reduced this effect to 
give an error rate below 0.1%/1000h for 
16K dynamic memories 5 . It is impractical 
to include error correction in small 8bit 
memories but parity checking to halt the 
processor when an error occurs is not. 

An odd-parity bit, generated by an 
LS280 parity checker when a byte is 
written into memory, is stored with the 
_ other eight bits. During the write-cycle the 
parity-ram data output is in its high-im- 
pedance state and the floating EO input is 
high. The parity device output is clocked 
into the ram input and correct parity is 
looked for when memory is read. On read- 
ing, the data output drives the parity 
checker and the error signal is passed to 
the error latch with the row-address strobe 
signals. If an error exists, the RAS line 
concerned is latched, a led indicates which 
memory bank contains the error, and the 
processor halts. 

Memory speed and drive 

Input characteristics of dynamic ram are 
quite different from those of t.t.l. Ram 
inputs are capacitive, which especially 
affects signals common to many inputs like 
RAS, CAS and WE, and they require little 
direct current. When driven directly from 
low-power Schottky t.t.l. these inputs can 
cause considerable overshoot that can re- 
sult in exceeding device specifications and 
longer access times through the time taken 



for the voltages to level out. 

To reduce ringing, some form af match- 
ing is required. Series matching is most 
appropriate since it does not increase static 
loading. The ideal driver would produce a 
slightly under-damped response but be- 
cause t.t.l. drive characteristics are asym- 
metric a compromise had to be made in the 
resistance value. Control signals are driven 
from LS37 clock drivers to ensure ade- 
quate drive toward the 5V rail. Resistance 
values are not critical for this relatively 
slow memory’ and the original even worked 
faultlessly with no damping resistors and 
standard LS00 drive. 

On analysing the timing requirement of 
the ram/M6809 interface I noticed that the 
most readily available 200ns rams leave a 
lot of spare time - so much so that these 
devices could theoretically be run with a 
666ns cycle time instead of the standard 
l^is. This was, of course, tried. Not only 
was it tried with the faster M6809A proces- 
sor but also with the standard device. In 
both cases functioning was faultless. This 
is not to say that all 1MHz parts will run at 
higher speeds but certainly 200ns access 
time rams will work at 1.5MHz. So for the 
cost of a new crystal the through-put of the 
system was unproved by 50%. 

Peripherals 

To ensure that 1MHz peripheral devices 
such as the 6821 peripheral-interface 
adapter and the 6850 communication-in- 
terface adapter operate correctly, the 
memory-ready signal (MRDY) is used. 
Whenever peripherals are addressed 
MRDY is held false by an LS122 monosta- 
ble multivibrator which extends the 
memory-access time. An M6850 commu- 
nication device forms the RS232 interface 
and the clock frequency for it is crystal 
derived. Currently the 1.5MHz c.p.u. 
clock only allows 1800bit/s and an external 
baud generator is an attractive proposition. 
Both -5 and +12V supplies are used for 



continued from page 48 
indebted to Keith Frewin, who wrote the 
SOFTBOX software, for providing roms 
385 and 386. 
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the RS232 interface. Current from the 
-5V supply is so low that the RS232 
driver has an active current limiter; the 
+ 12V drive is resistive. 

Many of you wil) not have an RS232 
terminal and will wish to use a separate 
keyboard and domestic tv. The keyboard 
interface will accept any 7bit parallel input 
signal with active-low most-significant-bit 
and active-low-going strobe and request 
signals. Two spare hand-shake lines on the 
p.i.a. and an output port could form a 
Centronics-type primer port. 

An EF69364A video i.c. provides timing 
signals necessary for a 625-line tv; a 
96364B device will provide signals timed 
for 525-line tv. Control code for the video 
i.c. is supplied through an LS157 quad 
two-to-one-line multiplexer and for normal 
display characters (p.i.a. B Dy=0) a fixed 
control code is set. When control charac- 
ters (hexadecimal 0 to F) are used the 
p.i.a. supplies the relevant code through 
the multiplexer (p.i.a. B D7=l) to the 
EF69364. As the c.r.t. gun scans the 
screen, the EF69364 selects the character 
to be displayed from the display ram and 
latches it into an LS273. 

The video i.c. was designed for use with 
ram that has separate data input and out- 
put lines (2101 ram) so the circuit was 
modified to allow 2114 rams with common 
i/o to be used. Character-code from LS273 
and row information from 69364 is 
supplied as an address to a character rom 
(a specially programmed 2716 eprom). 
Each character position is allocated a 7- 
wide-by- 12-high character block. 

Referring to last month’s article, the 
signal name at pin 6 of IC41 is active low 
and should read R, as should the signal 
name at the junction of IC47 pin 2 and IC45 
pin 3. On page 57, pins 13, 12 and 5 of the 
LS175 should be labelled Y 0 , Y x and Y 2 
respectively. 

A set of three programmed roms is avail- 
able from Brian Woodroffe at 632 
Queensferry Road, Edinburgh for £23.50 
inclusive. Technomatic (see advertisers’ 
index) will supply all i.cs mentioned in this 
articles 

Diskdrive interfacing is described in the 
next article. 
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Forth computer 

Interface circuits and software for disc-drive control are main subjects of Brian 
Woodroffe's third article describing his 6809-based microcomputer. First, operation of the 
video controller is concluded and i/o software discussed. 



Character-code and row information for 
the video-controller i.c. is supplied as an 
address to a character rom. Character in- 
formation for each row is fed to an LS165 
shift register and serial output from this 
register is combined with synchronization 
signals in an analogue gate to give a stan- 
dard IV p-p composite-video signal which 
is subsequently fed to a u.h.f. modulator. 

The dot clock, consisting of a Schmitt- 
trigger relaxation oscillator, should be ad- 
justed to the minimum frequency to 
minimize the luminance bandwidth re- 
quired in the monitor consistent with all 
text displayed on the screen. Character 
values 10 to IF hexadecimal are pro- 
grammed into the character rom to give 
coarse graphics. Two 2114 rams hold 
■ enough information for one 1024-character 
Forth screen to be displayed. 

Two further video rams store text nor- 
mally lost at the top of the screen. A switch 
allows a page of lost text to be displayed. 

Terminal and i/o software 

" The Forth reset routine checks to see if 
there is an M6850 present and if not auto- 
matically redirects terminal i/o routines 
from the RS232 interface to the p.i.a. for 
parallel i/o. Forth words giving access to 
user ports are included in this operating 
system. These words, P@and P! act in the 
same way as Forth words @ and ! except 
that they allow access to user i/o pons. 

The software-driven output word, PI, 
makes data available on the p.i.a. B lines 
then activates the address coded on the A 
lines. On-input, P@, reads data while the 
port address is made. Output ports ideally 
connect to LS273 latches and input ports 
to LS244 buffers. Port-strobe lines are de- 
coded from the p.i.a. A lines using LS138 
three-to-eight-line decoders. Eight read 
and eight write ports can be connected to 
this hardware and if more ports are needed 
then a further 6821 p.i.a. could be connec- 
ted and mapped into the USER variable- 
address area. Cursor control codes, i.e. 
decimal codes for EMIT, are as follows. 

8 left (backspace) 

9 right (tab) 

10 down (line feed) 

11 up 

12 home and erase 

13 carriage return 

14 home 

15 carriage return and line erase 

Disc interface hardware 

Interfacing to the floppy disc 6 is done 
using the most readily available controller 

The author is with Hewlett Packard in research 
and development. 
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since it is cheaper than using s.s.i./m.s.i. 
devices. Complexity of the WD1793 
controller is comparable to that of the 
6809. The first problem was interfacing an 
8080 style peripheral to the M6809 bus, 
the main difficulty being the writing data- 
hold times. 

The problem of data-hold times was 
■ solved using the memory-ready signal, 
MRDY, which when active (low) holds the 
processor clock cycles in an E-not-Q state 
for at least one quarter of a bus cycle. This 
quarter cycle provides the hold time. The 
memory-ready signal triggers a monostable 
multivibrator each time the procesor wants 
access to peripheral-drive address space 
between C000 and DFFF on the rising 
edge of the Q clock and this signal forms 
the floppy-disc controller write signal. 

A read signal is derived from clocks E 
and Q. Interrupt and data-request outputs 
of the floppy-disc controller are connected 
to the processor FIRQ pin so that data 
transfer can take place using the M6809 
SYNC instruction. As noted before, a 
floppy-disc drive’s data rate can cause 
problems when d.m.a. is not used. In 
double-density recording on a 5 l Ain floppy 
using a WD1793 controller, the worst-case 
data-transfer rate is 27(is/byte. Coding is 
shown in Table 1. 

The trick is that SYNC stops the 
M6809’s execution without affecting the 
clocks until the floppy-disc controller in- 
terrupt occurs and the processor resumes 
execution. This provides quick synchroni- 
zation between the processor and control- 
ler. Despite that modifying the direct-page 
register gives quicker access to the f.d.c. 
which is in high memory, this feature was 
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not used because of the extra coding 
needed. Had the processor clock been 
slower this alternative might have been 
necessary. 

Interfacing the floppy-disc controller to 
the drive is the next problem. Most of this 
is covered in an ANSI standard 7 but the 
problem of clock recovery remains. Be- 
cause of mechanical constraints, data read 
from disc will not be synchronous with any 
processor clock so clock information con- 
tained in the data stream must be 
extracted. In single-density recordings 
each bit cell has a clock bit and a possible 
data bit (no data bit is zero) and in double- 
density recording the position of the bit 
within the cell determines whether it is a 
one or a zero. A clock synchronous with 
incoming data is required to determine the 
incoming bit’s position. 

Although it gives the best performance, 
a phase-locked loop circuit was rejected on 
grounds of cost. Instead a crystal clock 
running at eight times the nominal read 
clock is used and a divide-by-eight version 
of this clock is phased with the incoming 
data to recover the original clock. First the 
incoming bit stream is synchronized to the 
crystal clock (x8) to produce pulses with 
accurately defined widths using an LS74. 
This pulse stream is fed to the floppy-disc 
controller (RAW READ). 

The reading clock is provided by an 
LS161 counter which is normally held off 
until the controller wants to read the disc, 
when the counter is enabled by the read- 
gate signal. This counter would normally 
free run at about the nominal clock rate, 
but it is synchronized by applying the raw 
read signal to its load input. The load 
frequency locks its D output (READ 
CLOCK) so that it changes mid-way be- 
tween input bits. As the maximum 
number of bit cells without read bits is 
three, the recovered clock never gets too 
far out of phase. 



send command byte to f.d.c. 

wait for f.d.c. response 

get status 

test byte-in 

no, then error 

get byte 

store, advance pointer 

reduce count 

loop back 

waittill 

f.d.c. finishes 



32 cycles at 1 .5MHz = 22ns 
Upon entry B=command code 

Y=pointertodata destination 
X=byte counter 



Table 1 . Code showing how the M6809 SYNC instruction is used for 
floppy-disc drive data transfer. 
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Problems with phasing are most notice- 
able when double-density recording is 
used, so a means of preventing bunching 
of the bits is used. Precompensatiotr 
prevents bunching by moving the written 
data bits slightly relative to the nominal 
position in a bit cell so that when the data 
is read back the bits appear to be in their 
correct positions. The matter of precom- 
pensation depends on the drive used. For 
those drives that do not require precom- 
pensation, including the TEAC FD50A 
used in the original design, the precom- 
pensation circuit is omitted. 

The disc should be set to respond to its 
address and head-load on drive select and 
not to the motor-on signal, i.e. the TEAC 
FD50 disc drive should be set as follows 
(for further drives, follow the same pat- 
tern). 

DRIVE 0 

HS=set, MX=set, DSO set, 

DS 1 ,DS2,DS3 = unset, 

HM = disconnected . 

DRIVE 1 (if fitted) 

HS=set, MX=set, DS1 = set, 
DSO,DS2,DS3= unset, 

HM = disconnec ted . 



Disc-interface software 

Under command of the c.p.u., the floppy- 
disc controller takes care of head posi- 
tioning, sector positioning, data serializa- 
tion and cyclic-redundancy checking. As 
soft sectoring is used, sector positioning is 
determined by the address record read 
from the formatted disc. The controller 
may be programmed to format the disc. So 
long as certain inter -record gap and record 
sizes are adhered to, the formatted disc 
capacity may be increased, Table 2. 

Different systems use different sector 
formats 9 , numbers and sizes of sectors and 
sector numbering systems. In this system, 
all variables associated with disc format- 
ting are defined by the user which means 
that most disc formats may be read. The 
sector size is written into the address 
record of each sector so it is possible for 
the system to adjust its buffer size to that 
of the disc. Forth word ?DISC is included 
to read the current disc and set parameters 
termed DENSITY, B/BUFand SEC/TRK 
to those associated with the disc. Only 
formats mentioned in Table 3 apply to the 
disc format program and ?DISC. 

When formatting a disc, it can be advan- 
tageous to interleave the sectors on a track. 
With this in mind a dummy word SKEW 
was included which is currently defined as 
no operation, but it may be redefined to 
perform an interleaving algorithm during 
formatting. Table 3. Defining Forth word 
FORMAT for disc formatting is shown in 
Table 4. 

Forth treats all disc memory systems in 
the same way, i.e. as a contiguous set of 
)D24byte screens, hence the choice of a 
v.d.u. Main Forth words used to gain 
access to screens on a disc are R/W , which 
moves data between a disc and memory, 
and BLOCK. As disc sector size depends 
on format, words BLOCK and constants 
B/BUF, bytes-per-sector, SEC/TRK, sec- 
tors-per-track, TRK/SIDE, tracks-per- 



Table 2. Capacity of a formatted disc 
may be increased provided that cer- 
tain record sizes and gaps are not ex- 
ceeded. 



Density 


Single 


Double 


Bytes/sector 


128 


256 


256 


512 


Sectors/track 


16 


10 


16 


10 


Bytes/track 


2048 


2560 


4096 


5120 


Bytes/disc 


82K 


102K 


160K 


205K 


Relative 


100% 


125% 


200% 


250% 



side and SIDE/DISC provide a means for 
Forth to work out which sectors make up a 
screen. The size of virtual memory buffers 
in Forth should be the same size as a 
sector. 

Time taken for the head to position itself 
over the relevant track is a major con- 
straint when using disc drives. Other time 
factors for a 5V4in floppy-disc drive are 
motor start-up time, head-load time and 
rotational latency. To speed up access time 
for double-sided discs it is usual to physi- 
cally combine two tracks on opposite sides 
of the disc into one logical track. This 
minimizes head seek time for it is likely 
that the sector required will be on the same 
bigger logical track and the time taken to 
gain access to the other side of the disc is 
governed by the time taken for an electri- 
cal switch to act rather than by the delay of 
a mechanical head seek. But since Forth 
treats all discs in the same way, including 
this feature would have meant that one 
could not mix single and double-sided 
discs. 

When using the Teac FD50A disc drive, 
access time is dominated by the start-up 
time of Is. If faster disc drives are used, 
time constants may be changed (discussed 
in a following article). Start-up time and 
head-stepping rate constants are moved 
into ram from eprom by the Forth start-up 
word COLD and may be modified to suit 
faster drives. Forth constants normally 
hold the values of constant^ in the parame- 



ter-field address (p.f.a.) but as this system 
is rom based, modification of the constants 
would not be possible so they are coded 
with a new routine which stores the value 
in ram. This list shows how the constant 
DENSITY is altered from single to double 
density and gives other constants and their 
meanings. 

DENSITY =1 (double density, 0 for 
single density) 

B/BUF =512 (number of bytes per 
disc sector) 

SEC/TRK =16 (number of sectors per 
disc track) 

TRK/$IDE= (number of tracks on disc, 
normally 35-40 for a mini- 
floppy) 

SIDE/DISC= 1 (2 for double-sided) 
SEC-OFST =1 (for numbering sectors 1 
to n, 0 for numbering 0 to 

n-1) 

1 (value to store, returned after 

execution of DENSITY) 

' DENSITY (find DENSITY p.f.a. 
address) 

@ (p.f.a. in this special constant 

points to constant position) 

! (store I there) 

Power supply 

Only one 15V secondary winding is re- 
quired on the transformer to provide a 
low-current — 5V supply for biasing the 
dynamic rams, +12V for the rams and 
floppy-disc drive and +5V for all logic 
circuits. A minimum value for the unregu- 
lated supply is determined by the 12V rail; 
unregulated input should be 20V to ensure 
adequate regulation with low mains sup- 
plies. Heaviest current demands are on the 
5V. supply and using a linear regulator to 
provide this rail would have resulted in 
excessive heat generation with a loss of 
efficiency so a switching regulator was 
designed. 



Table 3. Example of a routine for defining dummy word SKEW to give 
interleaved formatting. 



FORTH HEX 
: SKEW1 OOP 
1 AND IF 

SEC/TRK 2 / FE AND 
+ SEC/TRK MOD 

THEN ; 

' SKEW1 2 - 
' SKEW ! 


( select Forth and hexadecimal number base) 

( new word, duplicate sector # to be interleaved) 

( only even sectors are interleaved) 

( sector offset by half the disc) 

( add offset and keep with 0 . . , n-1 sectors on 
track) 

( find c.f.a. of new interleaving address) 

( find old skew p.f.a. and overwrite no-op there) 


Table 4. Routine for defining Forth word FORMAT for disc formatting. 


: FORMAT 

0 DR-SEL 1D0MS RATE CMND 
#SIDES 0 DO 
TRK/DISC 0 DO 
DP (S' 

1 J BLD-TRK WR-TRK 
track/side/status=" 1 . J . . CR 
1 STEP 
DP ! 

LOOP 

RATE CMND LOOP 
DE-SEL ; 

FORMAT 


( start compiling the word format) 

( turn disc drive on, seek track 0) 

( do for both sides) 

( do for all tracks) - _ ^ 

( save pointer to scratch area) 

( build up image of track, write It out) 

( inform user, 0= good status) 

( step in for next track) 
f recoverscratch area) ?= 

( for other side) 

( turn drive off, finish compilation) 

( carry out format) 



WIRELESS WORLD JULY 1983 



59 




(On processor board} 
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generator — K 11 60 A (Motorola) [7 i.o. bus (Density select) 




After bridge rectification and capacitive 
filtering, the 15V r.m.s. transformer out- 
put gives approximately 20V. Dynamic 
rams are sensitive to the sequence in which 
power is applied to them so the supply had 
to be designed so that — 5V appears first, 
followed by +5V then + 12V. 

Heart of the switch-mode power supply 
is a relaxation oscillator, the squarewave 
output of which feeds a charge pump to 
produce about —20V peak. This is regu- 
lated by a zener diode to produce — 5V. 
Reference for the +5V supply is a 10V 
zener diode connected in a feedback loop 
to maintain constant current even when 



Disc interface uses a readily available 
controller which works out cheaper than an 
equivalent circuit using s.s.i./m.s.i. devices. 
Clock information in data read from disc is 
synchronized using a crystal-controlled 
oscillator running at eight times the rate of 
the incoming-data clock. The prototype 
computer has a standard Teac 5 1/4in 
floppy-disc drive. 



Switch-mode power supply uses one 15V 
r.m.s. secondary winding for + 12V, -5V 
and high-current +5V rails. Frequency of 
the relaxation oscillator is 17kHz, giving the 
best compromise between smoothing 
component sizes and loss in efficiency due 
to switch transition times eating away at 
the duty cycle. Gating ensures that 
dynamic rams receive their three supply 
rails in the correct sequence ands.c.rs 
provide overvoltage protection. 



the 20V unregulated supply varies. An 
error signal derived from the + 10V refer- 
ence and +5V supply, and the relaxation 
oscillator triangle wave are fed to a com- 
parator. A portion of the triangle wave 
depending on the magnitude of the error 
signal is fed to the switching transistor. 
This pulse-width modulated base drive is 
disabled when the -5V supply is not 
present. 

The free-wheel diode, inductor and 
smoothing capacitor are fed by the 
switching transistor and are chosen with 
the operating frequency in mind. Around 
17kHz is used since it is the best compro- 
mise between high-frequency losses and 



component size. At low frequencies the 
smoothing capacitor and choke become too 
large and at high frequencies the switching 
transition time takes up a large portion of 
the cycle time and efficiency is reduced. 

Unregulated supply passes to the 12V 
monolithic regulator under control of a 
transistor switched by the +5V supply. To 
prevent overvoltage problems, an s.c.r. is 
included which switches on and blows the 
secondary winding fuse if either the +5 or 
4- 12 rails rise too high. 

To be continued with construction tips, parts 
list and vocabulary. 
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Forth computer 

Construction tips for the 6809-based Forth computer - part four. 



Most of the prototype version of this com- 
puter was constructed on one wire-wrap 
board. The number of signal buses ren- 
dered anything other than a multilayer 
printed circuit board an impractical solu- 
tion without splitting the circuit into sec- 
tions. Splitting the circuit was rejected to 
eliminate buffers associated with long 
cable runs. Wire wrapping provides 
connections at least as good as solder joints 
through cold welding between the wire 
and edges of the pin. 

All main memory, refresh circuit, 
microprocessor rom and interface i.cs are 
mounted on the main 229 by 178mm 
board, as are the video-display processor 
and memory. The analogue video gate and 
RS232 driver are built on two 16-pin dip 
headers. User-port hardware and the disc- 
drive interface between the floppy-disc 
controller and the drive are housed on a 
second wire-wrap board. There are many 
connections on the board so a powered 
wrapping tool, a stripping tool and dif- 
ferent coloured wires for different func- 
tions are useful. Copper-clad board was 
used for the power supply, which should 
be constructed before the main processor 
board. 

Dynamic ram takes little static current 
but substantial pulses, reaching toward 
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80mA per device over a few nanoseconds 
on some clock edges. Although the rams 
work within a 10% voltage tolerance, for 
reliable operation substantial local de- 
coupling must be included in the -I- 12 and 
-5V rails to overcome power-line induc- 
tance; each ram has a 0. lpF ceramic capa- 
citor on both supplies. Further lOpF bulk 
decoupling capacitors were used, one be- 




Voltage transients at the 41 16 dynamic 
rams showing from top to bottom the E 
clock signal and + 12V, +5Vand -5V 
supply lines with a 200ns/div timebase. 



tween each four devices. Decoupling capa- 
citors for the 5V rail were used throughout 
the design at the rate of one lOOnF compo- 
nent for each six i.cs. As with the RAS/ 
CAS/WE damping resistors, the design 
seems robust since the ram was initially 
built and worked without decoupling (see 
photograph). 

This is a large project and all construc- 
tion errors were found to be the result of 
either miswiring or plugging in the i.c.s 
wrongly. Dynamic rams I currently use 
got very hot when I plugged them in back- 
to-front. Construction should start with a 
minimum system, i.e. c.p.u., p.i.a., 
eproms and a 16K ram. At switch on, the 
lamp connected to the p.i.a. B-port Do line 
will go on then off. The state of this lamp 
then monitors the state of i/o data on the 
line. Ram-select lamps will stay off. V.d.u. 
hardware is self-contained so an idea of its 
performance can be seen on a tv screen 
without involving the main processor as 
the video i.c. generates its own characters. 

Connection of the parity circuit to 
HALT should only be made after the ram 
circuits are known to work, i.e. when the 
system ready message can be displayed 
consistently. Should the RS232 connection 
fail to work, the most likely cause, espe- 
cially if a signal at the a.c.i.a. output can 
be seen on resetting, is that data lines on 
pins two and three are crossed. Another 
problem could be that the RS232 terminal 



Main-board components 



Resistors 
Value Qty 

10k 8 



10k 

100 

500 

20k 

400 

33 

75 

150 

Ik 

2.3k 

4.7k 

2k 

2k 

5.1k 



2 

1 

1 

1 

4 

5 
1 
1 
5 
1 
1 
1 
1 
2 



Function 

pull-up, FIRQ, IRQ, NMI, VFOE, RESET, video 
and RS232 output 

pull-out parity, video ram, 9-resistor sii packs 

dot-clock 

dot-clock trimmer 

monostabletiming, 5% 

pull-up, led 

damping, RAS, CAS, R/W 

video output 

video output 

video ana RS232 output 

video output 

video output 

video output 

video output, trimmer 

RS232 output 



Capacitors 




Value 


Qty 


Function 


100p 


2 


+5V decoupling, 25V 


~20]i 


2 


+ 12V decoupling and reset, 25V 


10n 


8 


-5V and +12V decoupling, 25V 


lOOn 


57 


-5, +5 and +12V decoupling 


20p 


2 


crystal decoupling, 10% 


5 1 p 


1 


dot clock, 5% 


20p 


1 


monostable timing, 5% 



Integrated circuits 



Ref 


Qty 


Pins 


Type 


11 


1 


14 


LS280 


12-110 


9 


16 


4116 


21 


1 


28 


13242 


22-210 


9 


16 


4116 


31,67 


2 


20 


LS245 


32-310 


9 


16 


4116 


41,44 


.2 


14 


LS04 


42,47 


2 


14 


LS00 


43,72 


2 


12 


LS02 


45 


1 


16 


LS112 


46,53 


2 


16 


LS161 


47,48 


2 


14 


LS37 


51 


1 


40 


M6809A 


52 


1 


16 


LS139 


53 


1 


14 


LS122 


54 


1 


40 


WD1793 


55 


1 


40 


M6821 


62,63 


2 


24 


12732 


56 


1 


16 


LS175 


66 


1 


16 


LSI 57 


71 


1 


24 


M6850 


73 


1 


14 


LS86 


74 


1 


14 


LS132 


75 


1 


28 


EF96364 


76 


1 


20 


LS240 


77,78 


2 


18 


2114 


81 


1 


14 


LS00 


83 


1 


14 


LS04 


84 


1 


16 


LS161 


85 


1 


24 


12716 


86 


1 


20 


LS273 


95 


1 


16 


LS165 



See note for other i.c. locations 



Comments 

parity checker 
see note 

address multiplexer 
see note 

bi-directional buffer 

see note 

hex inverter 

quad 2-input NAND 

quad 2-input NOR 

dual JK bistable multivibrator 

sync, binary counter 

quad 2-input NAND clock driver 

microprocessor, 1.5MHz 

dual 2-to-4 decoder 

monostable multivibrator 

floppy-disc drive controller 

p.i.a. 

4K by eprom, T 8CC =450ns 
quad D bistable 
quad 2-to-1 line multiplexer 
a.c.i.a. 

quad 2-input ex-OR gate 
quad 2-input Nand, schmitt 
video display controller 
octal 3-state inverter 
IK by 4 static ram 
quad 2-input NAND 
quad 2-input NOR 
sync, binary counter 
2K by 8 eprom, T acc =450ns 
octal D bistable 
8-bit serial shift reg. 
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Other components 



Power supply 



2N2222 5 video, RS232 output transistors 
1N4150 2 video, RS232 output diodes 

2N2907 1 RS232 output transistor 

L.e.ds 4 parity checking, high-efficiency red 
6.00MHz crystal 
1.008MHz crystal 

DIP headers for video and RS232 output 
25-pin D-type connector for RS232 output 
Single-pole'two-way switch for display-page select 
Three, 16-way insulation-displacement connectors 
Vero 07-0130A wire-wrap board 

Wire-wrap pins (1 packet), wire, tool, un-wrap tool and wire 
stripper. Wire-wrap sockets: 



Pins 


Quantity 


14 


14 


16 


39 


18 


4 


20 


4 


24 


4 


28 


2 


40 


3 



Notes 

Memory circuit was designed using Mostek MK41 1 6-3 data 
sheet and most critical timing specification was T acc - 135ns 
(column-address strobe). Positions 105732,92 are 16-pin dil for 
plugs a,band c respectively. Positions 1691,93 are also 16-pin di! 
for RS232 and video signals. Resistors are 10% and capacitors 
are +80/-20% except where tolerances are given. 



MC3405 


op-amp/comparator, alternative 158 op-amp 
and 193 comparator 


LM7812 


12V, 1A regulator 


2N2222 


n-p-n (4 off) 


2N2907 


p-n-p (2 off) 


2N4036 


p-n-p (2 off) 


2 N 6476 


p-n-p (2 off) 


2N4443 


s.c.r. 


1N437 


ref. diode, alternative 1 N960B 9V zener 


1N4371 


zener, 2.7V 


1N4372 


zener, 3V, alternative 2.7V 


1N751 


zener, 5.1V 


IN 963 


zener, 12V 


MR852 


fast recovery diode 


MDA970-2 


bridge rectifier, 4A 


1N4150 


diode, alternative 30V switching diode, pref. 
Schottky 

high-efficiency red led, 2.2V drop 


HLMP-1300 


Capacitors 


In 


10% 


470n 


(2 off) 


lOOn 


(2 off) 


22p 


10V tantalum 


22u 


20V 


1m 


12V low equivalent series resistance, e.g. Spra- 
gue 672D046 or Dubilier UPC1052 
40V, alternatively 4m 


8m 

Resistors 



Disc interface 


Pins 


- 


0.13 

100 

133 


1W 
(2 off) 
0.25W 


Transformer is a 15V r.m.s. 2A 
type and should be protected by 
a 500mA slow fuse. A mounting 


Type 


Qty 


Comments 


200 


0.25W 


kit is required for the 2N6476, a 


LS244 


1 


20 


octal buffer 


680 


cooling tab for the T05 


•38 


2 


14 


standard t.t.l. quad NAND, o.c. 


Ik 


0.25W(6off) 


transistor, and the toroid is an 


LS123 


1 


16 


dual monostable multivibrator 


1.5k 




Arnold A-930157-2 with 35 turns 


LS161 


1 


16 


4-bit binary counter 


1.96k 


(2 off) 


of 21 s.w.g. (not 19 s.w.a. as on 


LS163 


1 


16 


4-bit binary counter 


3.16k 


the drawing). The toroid is 


LS74 


1 


14 


dual D bistable multivibrator 


10k 


(6 off) 


available from Walmore 


LS14 


1 


14 


hex inverter, schmitt 


28.7k 




Electronics Ltd, 11 Betterton 


LS04 


1 


14 


hex inverter 


75k 


(5 off) 


Street, Drury Lane, London 


K1160 


1 


14 


8MHz oscillator (Motorola) 


100k 


WC2H9BS. 


LS138 


2 


16 


3-to-8 line decoder 


50k 


preset pot. 



Other components 

Wire-wrap socket, 14 pin (4 off) 
Wire-wrap socket, 16 pin (10 off) 
Wire-wrap socket, 20 pin 
Wire-wrap board 176 by 1 10mm, 
e.g. Vero 02-01 20H 
34-way insulation-displacement 
connector 

34-way insulation-displacement cable 
to drive 

Disc drive, e.g. Teac FD50A (upto 4) 

Drive power connector (AMP1-480424-0) 
Pins for above connector (AMP6061 7-1 , 
60619-1, 4 off) 

Decoupling capacitors, lOOn (6 off) 
Decoupling capacitor, 1 00(x 
Input resistors, 333 (4 off) 
input resistors, 220 (4 off) 

Timing resistors, 30k (2 off) 

Timing capacitor, 2 m- 10V 
Timing capacitor, 33 m- 10V 



Alternative oscillator components 

Hex inverter, LS04 
Resistor, 464 (2 off) 

Capacitor, 20p 
Crystal, 8MHz 




takes too much current from the -5V 
supply, an indication being that the rams 
persistently give parity errors on power up 
which disappear when the RS232 terminal 
is disconnected. Forth response OK is 
preceded by the stack depth. 

The problem of driving capacitive loads 



with l.s.t.t.l. outputs showed up as un- 
dershoot in signals passing from the in- 
terface board to the controller. Although 
the prototype worked with the un- 
dershoot, it was cured by taking an 
inverted version of the required signal 
back to the main board and inverting it 



there with a spare l.s.t.t.l. gate. Capaci- 
tance of the insulation-displacement 
connection between the two boards was 
avoided in this way. Spare connections on 
the inter-board connector should be 
grounded and ground should be placed 
near active signals, e.g. clocks, disc data. 

Although for 8K of memory one gets a 
compiler and operating system and pro- 
gramming and execution unit there is still 
much to be done. I think that games are 
one of the best ways to learn about compu- 
ters for the definition of a problem to be 
solved is often as difficult as solving the 
problem. Forth is particularly suited to 
games programs — the Byte game contest 
was won by a game written in Forth 10 . 

Reference 

10. A. Saumon-Angus, Cosmic conquest. Byte, 
Dec. 1982, p.124. 

Further reading 

C. H. Ting, Systems Guide to Fig-Forth, 
Mountain View Press. 

Forth Dimensions, Forth Interest Group, 
PO Box 1 105, San Carlos, CA94070 (house 
magazine for members). 

Brian Woodroffe has found a way of speed- 
ing up disc operations and data-transfer 
rates so that faster units such as the Sony 
Microdrive and 8in drives can be used with 
the Forth computer. Descriptions will fol- 
low. ‘ TOtf 
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Forth computer 

Exceeding Brian Woodroffe's earlier expectations, his 6809-based Forth computer can be 
used with disc drives requiring high data-transfer rates - including Sony's microdrive 
and 8in floppy-disc drives. Access times for standard drives can also be reduced using a 

minor hardware modification. 



After using the computer for a while I 
became discontented with the disc system 
because the software caused a one second 
delay each time access to the disc was re- 
quired. This waiting time is needed to 
allow the drive to reach its operating speed 
— the software doesn’t know whether the 
disc is running or stopped before access — 
and much of the benefit of the virtual 
memory system is lost because of this de- 
lay. Forth keeps data from a number of 
disc sectors in memory. When data from a 
sector that is not in the main memory is 
requested by the program, Forth overw- 
rites one of the buffers with the required 
data. But if data in the buffer has been 
changed Forth first sends the data back to 
the disc so there can be two Is delays for 
one disc call. 

Keeping the disc constantly rotating is 
the easiest way of avoiding this delay but 
this was rejected because it shortens the 
life of the drive even though the motor is a 
brushless d.c. type. The method chosen 
relies on the fact that disc access operations 
are not uniform in time i.e., they are likely 
to come in bursts, especially when loading 
or listing screens from a disc and as a result 
of the virtual-memory buffer replacement 
algorithm described above. Keeping the 
drive running for a short while after a disc 
access is made means that it is likely that 
the disc will be running when the next disc 
access is required. 

Normally, the disc-drive motor is 
turned off by the disc-select signal going 
false (p.i.a. A port, D$, 0 to 1) when the 
program finishes using the drive. In the 
modification the drive motor enable signal 
is held true for five seconds after the drive- 
select signal goes false by a monostable 
multivibrator triggered by the trailing edge 
of the p.i.a. signal. As the software always 
assumes that the drive is up to speed and 
available, even though the monostable i.c. 
might have completed its cycle, a means of 
ensuring that the WD1793 controller 
doesn’t try to access the drive during the 
motor start period is required. During this 
period the ready signal is held false by a 
further monostable multivibrator fired by 
the drive-motor start signal. A low-pass 
filter after the NOR gate combines the two 
sources of motor-on signal to allow for the 
set-up time of the 5s monostable. 

This small hardware modification, con- 
sisting of two s.s.i. devices relieves the 
software of all considerations of motor- 
start latency. To prevent erroneous trig- 

Brian Woodroffe works in research and de- 
velopment at Hewlett Packard. 



bv B. Woodroffe 



gering the two monostable multivibrators 
should be grounded separately. 

Interfacing 8in drives 

I found it galling that my 1.5MHz 6809 
Forth system could not be interfaced with 
faster 8in drives, especially as these are 
often available second-hand at bargain 
prices. I have not yet got an 8in drive but I 
have been fortunate enough to try one of 
the sub-5in drives from Sony which has 
the same data rate as 8in drives. There is as 
yet no de jure standard for microfloppy- 
disc drives 1 but within Hewlett Packard, 
the Sony drive is the de facto standard. The 
first problem is to build a data-service rou- 
tine that services the disc at a rate better 
than llps/byte. Although nominal disc- 
data transfer normally takes 16ps/byte, 
Western Digital specify 1 1 and 13ps worst- 
case service times for write and read res- 
pectively. 

The previously used software loop 
(Wireless World , June 1983) achieves far 
worse than lips, even with the M6809 
direct-page register modified to make the 



Ust 1. in this design the following Forth 
words are available. 



controller i.c. accessible through direct 
addressing. Analysing the software loop 
shows that two functions are being carried 
out - a byte is transferred between the 
WD1793 controller and ram, and bytes are 
counted to determine when the sector 
operation is completed. If the second func- 
tion could be dispensed with the remaining 
loop would be much smaller and faster. 
There would be a small penalty in that the 
ability to read from and write to consec- 
utive sectors would be lost as no byte count 
is kept. The problem now is how to break 
out of the disc-service software loop. For- 
tunately the controller gives hardware help 
here in that the IRQ line is activated on 
completion of every command; the DRQ 
line is activated for each byte transfer. 
DRQ, connected to the M6809 FIRQ line, 
is used in the data transfer loop to make 
the processor clear its SYNC state, thus 
synchronizing controller/processor trans- 
fers operations. 

Interrupt request IRQ is tied to one of 
the other M6809 interrupt lines so that 
when a .read or write-sector command is 
complete the processor aborts its current 
data-transfer loop activity and commences 
the interrupt routine. On application of 
the FIRQ signal the processor does not 
abort the data transfer loop and carry out 
the FIRQ routine because the program 
inhibits the FIRQ interrupt by holding the 
FIRQ mask bit in the condition-code re- 



execute 

cfiir 



(builds does; 



CURRENT 



K [ 

TRAVI RSF 

I)A1>1 MO 
DLL I HAL 

PlU 

BLANKS 
P IND 



LOAD 
t ND If 
END 



U/HUF 

"FERS BLOCK 



fi/SCR 

DRIVE 

FLUSH 



DABS 

LEAVE 

SWAP 



’HAIRS 
SMUDGE 
-TRAILING 
JR TACK 



• LINE 
BACK 
‘LOOP 
ELSE 



(♦LOOP) 

(FIND) 

CMOVC 



HEX 

’FREE 

ERASE 

NUMBER 

T COMPILE 1 

DEFINITIONS 

iDR 

MESSAGE • 

BEGIN 

UNTIL 

UHILE 

D.R 

INDEX 

USEBLK 

;CDDE 

TASK 
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Disc interface modification speeds up 
overall access time by keeping the drive 
motor running for five seconds after the 
computer tells it to switch off. This 
significantly reduces the effect of a one 
second delay required for the motor to 
start up since disc-access operations tend 
to come in bursts. 



U.h.f. modulator connects to the video- 
controller circuit output (see June issue) so 
that the computer can be used with a 
standard tv set. ► 



gister set. The controller interrupt-request 
line IRQ, being connected to the proces- 
sor’s non-maskable interrupt input NMI, 
can never be masked so when this line is 
true the processor must be interrupted and 
goes to the routine requested by IRQ. The 
processor IRQ interrupt-request input 
could have been used but I wanted to leave 
it free for expansion. 

For sector read and write operations the 
disc controller interrupts on transfer of the 
last byte. In the case of a sector-write 
operation the data-transfer routine is fin- 
ished when the last byte is written into the 
controller. In sector read operations the 
data transfer routine is finished not when 
the last byte is read from the controller, 
but when it is written into the ram sector 
buffer. So when a sector is written the 



controller interrupts after all data transfers 
have taken place but when sectors are read 
the controller causes a jump out of the 
software loop before the last memory-sto- 
rage operation is carried out. Worse still, 
latency before the controller interrupt is 
variable so when an interruption is made, 
whether or not the memory has been up- 
dated remains in doubt. The solution I 
chose was to code the data-transfer loop so 
as to maximize the time between reading 
data from the controller and writing it into 



the memory. Inclusion of a no-operation, 
NOP, increases the data-transfer loop time 
to just under the allowable maximum of 
Bps to ensure that the controller always 
interrupts before the processor can write 
the byte into memory; the first operation 
of the interrupt routine is to write that byte 
into memory. Unfortunately this writing 
operation done outside the data-transfer 
loop means that the interrupt routines for 
sector reading and writing must be dif- 
ferent. 
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MEMORY SOURCE 




Nofes - By changing contents of location 'vector6‘ the write routine 
exits from its loop to location ‘WNMI* 

- Vector6 is in ram, all other locations are eprom 



The processor starts its interrupt sequ- 
ence by pushing appropriate registers onto 
the stack and jumping to code pointed to 
by a vector in high memory. In this Forth 
system high memory is eprom so the in- 
terrupt vectors cannot be changed to point 
to different routines. I remedied this by 
making the interrupt vectors point to code 
which executes a jump to a location deter- 
mined by a value stored in ram. By chang- 
ing data in- the ram location the non-mas- 
kable interrupt vector can be altered 
during program execution. This practice is 
unstructured and therefore unfashionable, 
but it is highly effective. 

Normally an interrupt routine is com- 
pleted by a return-from-interrupt instruc- 
tion which restores processor register 
values to those prior to the interrupt, i.e. 
restore context. In this case the interrupt 
vector is being used as a jump instruction 
to jump out of the data-transfer loop so the 
first operation in the NMI routine is to 



Diagram of program flow during a sector. 
read on the Forth computer. 
delete saved registers (LEAS 12,S). As the 
controller is connected to the non-maska- 
ble interrupt line there is the potential for 
the occurrence of an interrupt when one is 
not required. To prevent this the NMI 
vector points to a safe routine when not in 
use which reads the controller status regis- 
ter, clearing the cause of the interrupt be- 
fore carrying out a more normal return 
from interrupt, RTI, operation. 

Extra signals to the disc drives, e.g. 
track 42, should be inverted and buffered 
using standard t.t.l. open-collector drivers 
(7438) as used for the WGATE signal. 
Extra input signals from the drive such as 
READY should be buffered using say two 
LS14 gates, and terminated as previously 
shown. I should have used the drive’s 
ready signal, eliminating one of the monos- 
table multivibrators connected to the index 
line but I did not. Also I connected my 



motor-on signal (5.25in drive, pinl6) to 
the Sony drive head-load line, pin 14, 
whereas I should have used hold, HLD on 
pin 28 of the controller. These minor mod- 
ifications were made because I still intend 
to use 5.25in drives as they currently offer 
better value for money than the Sony 
drives at one-off prices. 

The matter of write-precompensation 
has not yet been resolved. I found by trial 
and error that for the Sony drive at least 
write-precompensation is not mandatory. 
It might be necessary for older 8in drives, 
and in commercial products to minimize 
the number of attempts to read the disc. 
Details of precompensation circuits are 
given in the Western Digital handbook 
and reference two. 

For drives that keep the disc rotating, 
such as Sony’s and most 8in drives, the 
disc speed-up hardware previously des- 
cribed should not be fitted but the drive 
should be connected with the motor-on 
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I List 2. Forth words specific to this system. 


Note the novenent of 




si s2 . . ABC . . 


r 1 r2 r3 


forth word 'ABC' v 


kns two values off the black <s2~top of block) anti produces 


three reoulto (r3= 


nu top of stack) . 


VEHIT,VKEY,V?TER 


- 0 liber variable containing the execution vector for 
EMIT , KEY, ^TERMINAL respctively 
. . VEHTT . . rfddr ..VKEY..addr . . V’f TER . . oddr 


DPNAX 


- a user variable containing the maximum address for the 
die t lunar y , 

. . DPMAX. . add.- 




- a user variable containing the nanmin depth of data 
stack allowed for this user 
. .SHAX. . addr 


P 1 ,PS 


- a UORD to -..tore a byte to, read a byte fron one of the 
user ports 


DENSITY 


- a constant -0 for single density; -1 for double density 
. . DENSITY. .boolean 


DR-SLt 


- a UORD to select the drive value top of v.tack 
value . . DR-SEL . . 


DE-SEL 


- a UORD to dealer! the disc drives. 
, . DE-SEL . . 


VERIFY 


- a variable which if true causes a read after write 
verify of disc writes 
. .VERIFY. . ad tit 


SIDE /DISC 


- a constant returning the nunher of sides per disc 
. .SIDE/DISC. .value 


SEC/TRK , TRK/SIDE 


-= constants returning the lumber of sectors pec track and 
tracks per disc respec t i vely . 

. . SEC/TRK .. value . .TRK/SIDE. .value 


SIDE 


- a UORD to select the side of the disc depending on the 
value at top of stac k 
value . . SIDE . . 


VUR1 IE 


- if tiisi is wr i le-pi o 1 t?c t ed issue error Message 10 and 
abort execution to return to terminal node 
. . TURITE . . 


OMNI) 


- a UORD to execute '•'•••'• ronnand at top of stack 
value . . CMND . . 


RATE 


- a CONSTANT which equals the disc drive stepping rate as 
coded for the UD1793 
. .RATE . . value 


SEEK 


steps the floppy drive lo seek the ti-at k that 'is al top 
of stack 
value . .SEEK. . 


STEP 


- a UORD to step- the fl.oppy disc, IN if TOS-'l 
OUT if T0S=-1 
sane direction if TOS^O 

value . -SEEK, . 


R -ADR 


a/ flag-true if on .uccessful (-statu-, of UD1793) 
li / set. t or_t ype, sector, side, track , false flay 
..R-ADR.. type sector side track 0 or 

. .R-ADR. . true 


BREAD., BURITE 


- sinalar to FORTH Bl.OCK.READ , BLOCK .UNITE except 
additional parameters are included, sector within a 

returns UD1793 status O-succesful 
address side sector ..BREAD., status 


SEC-R/U 


- cinilai ,t o MIRTH R/U extepi no linit < her king is done. 
Returns to i op of slaik a flag. U-successful operation 

Z - eoli err or 
3-noT ready 

addr set tor flag . .SEC-R/U.. .tarns 


7 l?K UR 


a UORD to write a whole track to disc, used in for Malting 

tnnage. Return WD1773 status, (1 successful 
addr . . IRK-UR . . sta lus 


TRi; -BED 


- a UORD to build tn humiii y a liyte iMage of a ir.uk prior 
to be written ou' bv TPK-UI’. lake-, a, input the side arid 
track nunber whose iMage is to be Turned anti produces 
the address of where the inane i,. 
side track . . IRK-BI D . . addr 


,„ K r 


- a UORD lo find out what sort of di-.r i on Urn drive, 
sots CONSTANT, B/BUr , DFNS1 ! Y , ST C/ IRK t u suit. 

. . ’DISC. . 




- a UORD to delav ext?< u t t on f.y the mmbei of Milliseconds 
that is top of stack 
value . .NS. . 




currently a no open a 1 1 on wind, that i aide- patched to 
allow inter Inau tug ( .J the set tors when foinattmg a dt.c, 
Intended usn,,ge i, tha< it will Modify Hie set.toi nunl.e: 
cor.-ently at tup of .tack. 



signal permanently true, i.e. grounded. 

Software issued (first revision) assumes 
the presence of disc speed-up hardware 
and includes the faster data-transfer loop. I 
will supply a drive pin connection list and 
format program for the Sony drive that can 
be modified for 8in drives to readers 
sending an s.a.e. to me at 632 Queensferry 
Road, Edinburgh. The Forth word BLD- 
TRK in eprom is only suitable for mini- 
floppy disc drives. 

Thanks to Hewlett Packard for the use 
of their test equipment and Sony for the 
loan of a microdrive. Software used, based 
on the FIG model, was prepared on an 
HP64000 microprocessor development 
system. 

Integrated circuits 87 and 88 were miss- 
ing from last month’s components list. 
They are 2114 static rams. In the photo- 



graph of power-supply spikes, vertical sen- 
sitivity for all but the clock signal is 

0. 5./div. 

References 

1. J. Bovin, Floppy incompatibility, Systems 
International, May 1983, p.61. 

2. J. Hoeppner and L. Wall, Encoding/decod- 
ing techniques double floppy disc capacity, 
Computer Design, Feb. 1980, pp. 127-135.^^7 

Brian Woodroffe plans to describe the Forth 
language in a subsequent series. 
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Complementary 

current mirror 



Current mirrors with transistors of the 
same type of conductance are well known 1 
and widely used in integrated circuits 2 . It 
is possible to create the configuration with 
similar properties using complementary 
transistors also, Fig. a. Accepting the 
usual assumptions 2 that 

Ici = Isiexp|VBEi|/VT 
Ic2= Is2®XpV BE2/V T 
Ibi=Ici/Pfi 

and Ib 2 “Is 2 ^Pf 29 the output current is 



( 1+ J_) J5! + L 
^ Pfi/ Is2 Pf2 

with |V BE] |=Vbe 2 . If the technology 
allows two complementary transistors with 
Isi = Is2s then 



IC2= 



1+ PF1 + f*F2 



1 H 1 Pfi (W 



as in the ordinary current mirror. Usually 
n-p-n and p-n-p transistors in an inte- 
grated technology are produced by dif- 
ferent methods and parameters Isi and Is 2 




Complementary transistor current mirror 
with matched transistor (a) and matched 
emitter resistances (b). 



are not matched. But the discrete current 
mirror with matched resistors in emitter 
circuits works reasonably well. Fig. b. In 
this circuit 



and 



IejR+ |Vbei I =V BE 2+Ie2R 



Iei=Ie2+ 



jVBEl|-VBE2 

R 



If transistors are designed for complemen- 
tary operation, say 2N4401 and 2N4403, 
and emitter resistors are matched to within 
1% the error is 2 to 3% without pre- 
liminary transistor matching. The gain (5pi 
of a discrete p-n-p transistor is usually high 
and the collector currents happen to be 
matched also. - I. M. Filanovsky, Uni- 
versity of Alberta. 

1. F.J. Lidgey. Looking into current mirrors. 
Wireless World, October, 1979, vol. 68, pp. 51- 
58. 

2. P. Gray, R. Meyer. Analysis and design of 
analog integrated circuits. Wiley, 1977. 7^7/7 
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Complementing his description of a 6809-based microcomputer, Brian Woodroffe details 
the language used - Forth - and why he chose it, in this second series. 



Forth is a language well suited to modern 
microprocessors and is widely used in such 
diverse applications as word processing,, 
data-base management, instrument and 
process control, video games and data ac- 
quisition. In a kernel of less than 10Kbyte 
the following features are provided 

— An interactive system. 

— A high-level compiler with all standard 
control features. 

— Fast execution, comparable with 
machine code because of the compiler. 

— The language system is largely 
processor independent; only around 20% 
of the code written in assembly language 
need be changed to suit the computer. 

— Virtual memory and application- 
oriented program modules. 

Further, the system may be readily ex- 
tended to suit new applications because the 
compiler can be modified by the user and 
new data structures introduced. These fea- 
tures are achieved by defining a virtual 
machine which is easily simulated by any 
target machine. Using ‘threaded code’, 
transferring control in the host from one 
virtual machine instruction to the next is 
quick and easy. Instructions of the virtual 
machine are used to build the monitor and 
compiler. Using the monitor the user may 
examine the effect of a series of Forth 
instructions and using the compiler this 
series may be added to the instruction set 
for future use. 

Background 

Forth is a computer language for fourth 
generation computers 1 . The language 
would have been called Fourth but six 
letters would not fit in the IBM1130 job- 
control language that its inventor, C. H. 
Moore, was then working with. Today 
Moore’s company Forth Inc. is foremost in 
marketing FORTH for many different ap- 
plications, besides the field of astronomy 
where it first found favour 2 . Other com- 
panies such as Miller Microcomputer Ser- 
vices and Laboratory Microsystems sell 
their own versions of Forth but the prime 
mover of Forth in the home-computer/ 
hobby field is the Forth Interest Group* 
(FIG). They have made versions of Forth 
available for many computers including 
the PDP-11 and for 8080/Z80, 6800, 
8086/8088 and 6502 processors. There are 
many versions of Forth and while all are 
similar no two are necessarily identical. 
For example, Poly Forth, FIG Forth and 
Forth 79 are all Forth but they are not the 
same. They differ primarily because of 
differences in the processor on which they 
run (16 or 8 bit memory, port or memory 
mapped Uo, etc.). FIG Forth will be used 
in all following examples. 



*Forth Interest Group, PO Box 1 105, San 
Carlos, CA94070, USA. 
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Forth is a collation of different sofware 
concepts forming a coherent whole. As an 
operating system, it is not as powerful as 
most but it takes care of all terminal and 
disc input and output. Small assembly-lan- 
guage routines must be supplied by the 
user to interface his hardware to the rele- 
vant system calls. It is also possible that 
memory-allocation changes may also have 
to be made. Most of Forth is written in 
Forth. It may seem strange that a language 
may be defined in terms of itself but one 
would use English words to explain the 
English language. Defining the language 
in this way means that programs may be 
transferred between different computers 
and implementations. There is a base in- 
struction set which must be written in the 
machine code of the host computer. This is 
the only machine code required and the 
process is known as simulating a virtual 
Forth machine. 

Most computer languages are programs 
which, recognizing statements in a source 
language, convert them into a target lan- 
guage. Usually the source language is text 
readable by humans in ASCII form and 
output is machine code of the computer. 
This is not always the case: cross compil- 
ing results in the target code being dif- 
ferent from the host computer machine 
code. More exceptionally there are cases 
where the machine code can only be exe- 
cuted by a hypothetical computer, an 
example being O-code for the language 
BCPL 3 and P-Code for certain implemen- 
tations of Pascal 4 . This is also the case for 
Forth and the virtual-machine execution 
mechanism will be explained first. 

Threaded code 

Explanation is simplified by visualizing a 
machine-code program for the processor 
concerned as a succession of subroutine 
calls. These calls transfer program control 
to each subroutine in turn. A stack, i.e., 
last-in-first-out list, would be the mechan- 
ism by which each subroutine returns 
control to the correct point in the m ain 
program. Knowing that the main program 
is solely a succession of calls it is now 
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possible to reduce the main program to a 
list of subroutine addresses by removing 
the subroutine op-code, and to have a 
special program known as an address in- 
terpreter to transfer control down the main 
program address list. This is called 
threaded code, for the main program is the 
thread into and out of which the address 
interpreter threads control 5 , List 1 . 

In List 1, letters A, B and C denote 
machine-code subroutines, ip is the 
threaded-code instruction pointer and 
parentheses indicate one level of indirec- 
tion. Threaded code trades the cost of the 
code for each call saved for address in- 
terpreter speed. In a long program the 
code cost of the address interpreter will be 
negligible. Further savings can be made by 
replacing the subroutine return statement 
by a jump to the address interpreter and 
changing the address interpreter as shown 
below. This releases the stack pointer used 
for subroutine calls and returns. It is im- 
portant that the instruction pointer can be 
speedily accessed, for example by keeping 
it in a processor register, so as not to slow 
down the address interpreter by causing 
unnecessary memory activity. 

If the lists are considered to be the ac- 
tions of a virtual machine then a software 
routine NEXT represents the hardware 
execution fetch of the virtual machine. In a 
threaded-code computer the time of in- 
terpreting these lists is dominated by the 
time of the NEXT operation so it is best to 
run threaded code on a computer that 
handles NEXT efficiently or to use 
microcode. 

Code routine including return 
A: xxx 

jmp NEXT 

New address interpreter 
NEXT: ip+1 ->ip 
jmp [ip] 

Indirect threaded code 

The next improvement is to allow called 
routines to be not just pure machine code 
but also address lists. This is done by having 
a special routine that knows that the fol- 
lowing data in the list are not code but 
addresses that must again be interpreted. 
Further, the routine must suspend interpre- 
tation of the main program while interpret- 
ing this new list of addresses. Return of 
control to the suspended list is done using a 
stack to save and restore the instruction 
pointer which is similar to the machine-code 
subroutine call/return operation. There 
must be an equivalent code routine to return 
control to the main list. 

Normal code routine 
A: machine code 

jmp NEXT 
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Threaded routine 
P: sp—1 — > sp 

ip - > [sp] (push current ip) 

#L-1 ->ip (start interpreting new 
list) 

jmp NEXT 

L: A (code routine) 

B 

C 

Return routine 

[sp] ”> ip (pop ip) 

sp+1 — > sp 

jmp NEXT 

As most routines are likely to be lists and 
not machine code this stacking method, 
similar to subroutine calling, will take a lot 
of code area. Considerable space would be 
saved if there was just one copy of this 
routine. The address interpreter would 
normally jump to this routine but it would 
also have to execute code routines. This is 
done by making the first element of each 
list a pointer to code rather than the code 
itself. In the case of lists the pointer points 
to the stacking operator but with code 
routines it points to the next code address. 

New address interpreter 
NEXT: ip+1 — >ip 
[ip] -> w 
jmp [w] 

Stacking operation 
DOCOL: sp—1 — > sp 
ip ->[sp] 
w+1 ->ip 
jmp NEXT 

Destacking operation 
SEMIS: [sp] — >ip 
sp+1 — > sp 
jmp NEXT 

Code routine 

A: $+1 (point to next location) 
xxx 

jmp NEXT 

List routine 

DOCOL 

P 

Q 

SEMIS 

This is the equivalent of machine-code 
subroutine call and return instructions. In 
Forth, the stacking and destacking opera- 
tions are called DOCOL and SEMIS res- 
pectively. At the beginning of each address 
list, the extra address introduces a level of 
indirection — this is indirect threaded 
code . In Forth the lists are divided into 
two parts, one being the code field which 
points to the address and the other known 
as the parameter field where the code is. 
These two parts and dictionary data, to be 
described, form a WORD. Code pointed 
to by the code field determines how the 
parameter field is interpreted. In the case 
of code words, the code field points to the 
parameter field. When the code field 
points to DOCOL, the parameter field is 
to be interpreted in a similar way to a 
subroutine. It is possible for the code field 
to point to some other routine which may 
make different use of the parameter field. 

1 wo examples of this in Forth are DO- 
CON and DOVAR. The former treats the 



74 



value in the parameter field as a constant 
and pushes it onto the data stack, to be 
described, whereas DOVAR pushes the 
address of the parameter field which is 
used as the storage location for that varia- 
ble. To enable these routines to access the 
parameter field a third register, known as 
V, is required. 

The address interpreter for indirect 
threaded code is more complicated than 
that for direct threaded code and so it is 
even more important to choose a processor 
with a suitable instruction set. Surpris- 
ingly for direct threaded code, NEXT can 
normally be coded using the processor 
subroutine-return op-code provided that 
the processor uses a stack that may be 
placed anywhere in memory. As the stack 
pointer is pointing to the thread, the 
processor must not receive interrupts for 
the status cannot be saved without des- 
troying the thread. NEXT for indirect 
code is more complicated as it involves an 



extra level of indirection. 

Choosing a processor, stacks and language- 
control structures are subjects of the next 
Forth language article. 

An i.c. in the Forth computer switch- 
mode power supply on page 61 of the July 
issue was incorrecdy designated the 
MC3045. The correct designation is 
MC3405. 

References 
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the computer will have a repertoire of 
instructions that it can execute. This is 
normally the machine/assembly level in- 
structions. However by running a 
program on this machine it can be made 
to took as though it has a different in- 
struction set. it is possible to time she/e 
the computer between two or more users 
so that they both think they have a 
separate computer. These techniques are 
known as creating a virtual machine. 

flp-oodo. Each different instruction Is en- 
coded into a unique symbol (usually bi- 
nary. known as an op-code. 

Host computer. The computer on which 
the program is currently executing. 
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Forth language 

Selecting a processor to suit the language, and control structures are subjects of Brian 
Woodroffe's second article illustrating why he designed his computer around Forth. 



Forth’s speed is directly related to how 
efficiently the computer can execute the 
NEXT operation. The Table shows how 
NEXT is coded for some popular eight-bit 
microprocessors; the 6809 processor exe- 
cutes the operation quickly so a NEXT 
operation may be included at the end of 
code routine. This improves performance 
since the ‘JMP NEXT’ operation needed 
for most processors is avoided — in stark 
contrast to conclusions drawn .from one 
manufacturer’s benchmark tests' . 

NEXT is the virtual-machine instruc- 
tion fetch so the choice of a processor to 
run Forth on should be dominated by 
speed and memory costs of the NEXT 
operation. Further, 6809 registers exactly 
match those required for Forth as can be 
seen in List 2. Machine code in the host 
computer represents the Forth machine, 
the Y register taking on the role of the 
Forth program counter. Following 
examples of simulating the virtual 
machine, in 6809 machine code, confirm 
that this processor is well suited to Forth.. 

The stack 

So far, only the control mechanism by 
which Forth transfers control from one 
word to the next has been described, but 
the language must also control and mani- 
pulate data. This, too, is done by means of 
a stack, but this storage area is known as a 
data stack, as opposed to the one 
previously described which is known as 
the ‘return’ or ‘control’ stack. Separation 
of the stacks simplifies things; normally, 
data and control operations use the same 
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stack. The stack is further broken down 
into ‘frames’ with markers to denote which 
part is what. In Forth all operators, such as 
the words + and AND, may remove in- 
structions from the stack, destroy . them, 
manipulate them and push results back 
onto the stack many times. This has the 
advantage that operators need not be told 
where their operands are, which results in 
less code. A computer operating this form 
of addressing is known as a zero-address 
List 2. Registers of the 6809 suit Forth 
requirements. 



6809 register 


Forth usage 


S 


stack pointer 


RP 


return stack 
pointer 


U 


user stack pointer SP 


data stack 
pointer 


Y 


index register 


IP 


instruction 

pointer 


X 


index register 


W 


current c.f.a. 


D 


accumulator 


- 


accumulator 



machine, for operand addresses are impli- 
cit in the instruction. These words may be 
in the machine code of the target computer 
or determined using words already de- 
fined. 

Using a stack avoids problems caused by 
parentheses and operator precedence. As 
far as the computer is concerned the prob- 
lem is solved, List 3, but programmers 
used to infix notation may find postfix 
notation (reverse-Polish notation) difficult, 



List 3. Some 6809-code arithmetic routines 

including add, subtract and two's 

complement. 

"+’• FDB S J -2 
PULU D 
ADDD 0,U 
STD 0,U 
NEXT 

MINUS FOB $+2 
LDD #0 
SUBD Q,U 
NEXT 

@ 'FOB $+2 (fetch) 

LDD [0,Uj 
STD 0,U 
NEXT 

i FDB $+2 (store) 

PULUX 
PULU D 
STD 0,X 
NEXT 

DUP FDBS+2 
LDD 0,U 
PSHUD 
NEXT 

OVER FDBS+2 
LDD 2,U 
PSHUD 
NEXT 

SWAP FDBS+2 
PULU D,X 
EXG D,X 
PSHU D,X 
NEXT 

DROP FDB $*2 
LEAU 2,U 
NEXT 

NEXT is defined as a macro instruction. 



Table. Coding and performance 
analysis of the Forth NEXT operation 
for popular eight-bit micro processors. 

Processor 6809 6800 



Postfix Infix 

34+56+ x (3+4)x(5+6) 



8088 6502 



Memory bytes 
Processorclock 
cycles 

Normal cycle 
time 1ns) 

Total time (us) 
Memory -access (ns) 
Timefor 450ns- 
access memory (u‘ 
Speed relative 
to 6800* 



LDX0.Y-- JMP NEXT JMP NEXT 
JMP [0,X] LDXIP LDAXB 
INX INXB 
1NX MOV L,A 

STXIP LDAXB 
LDX 0,X INXB 
STXW MOV H,A 
LDX0.X MOV E,M 
JMP 0,X INX H 

MOV D,M 

XCHG 

PCHL 



JMP NEXT 
LODS AX 
MOV BX,AX 
MOV DX,BX 
INC DX 

JMP WORD PTR1BX1 



JMP NEXT 
LDY #1 
LDY [IP1,Y 
STAW+1 
DEY 

LDY [IP],Y 

STAW 

CLC 

LDAIP 

ADC #2 

STAIP 

BCCL 

INCIP+1 

LJMPW-1 



4 17 



28 



14 44 60 58 



43 



14 44 

695 530 

9 37 



0.25 0.2 

15 11.6 

250IZ80) 450 

27 11.6 

1.37 3.19 



43 

650 

29.7 

1.25 



•Value rises proportional to speed. 



Parameters are also passed between 
separate lists using the stack. The word 
consumes as many stack elements as re- 
quired and pushes back its results. Some 
defined Forth words for subtracting and 
doubling the top of the stack respectively 



‘‘-’’FDB DOCOL “2*”FDB DOCOL 
FDB MINUS FDB DUP 

FDB ADD FDB PLUS 

FDB SEMIS FDB SEMIS. 



Language control structures 

As has been shown, Forth passes control 
from one item in a word to the next and 
results are calculated. These words can be 
either machine-code words or pointers to 
other words. How control may be diverted 
to form if-then-else or repeat-unt.il .sta- 
tures is the following subject, starting with 
an explanation of how Forth tests for true 
or false conditions by simply considering a 
non-zero value at the top of the data^stack 
as a true condition. Examples of conditions 
that create these flags are ‘0=’, *0<’» ‘=’ 
and '<’ in the form of code words or Forth 
words, as appropriate, Lists 4, 5. Diver- 
sion of control is carried out by Forth 
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List 4. Code routines leaving a flag at stack 

OEQUAL FDBS+2 

LDD#1 assume true (i.e. 
zero) 

LDX0.U + + get operand, set 



BEQ0E1 

DECB 

0E1 STD 0,U 

NEXT 

OLESS FDB $+2 
LDB #1 
LDA 0,U 
BMI0L1 
CLRB 

0L1 ; CLRA 

STD 0,U 
NEXT 



was <>0 so set 

Forth flag 

put back Forth flag 



prepare true 
get sign to A 

no, leave false 



List 5. Forth routines leaving a flag. 
FDBDOCOL 
FDB SUB 
FDB OEQUAL 
FDB SEMIS 
"<" FDB DOCOL 

FDB SUB 
FDBOLESS 
FDB SEMIS 
“>" FDBDOCOL 

FDB SWAP 
FDB LESS 
FDB SEMIS 



words BRANCH and OBRANCH, the 
former taking the next storage cell as a 
branch offset and the latter branching or 
not depending on the value at the top of 
the stack. If the flag is false, the threaded- 
code instruction pointer, ip, is incre- 
mented by the offset value contained in the 
next program storage cell. When the flag is 
true, this offset is skipped and execution 
continues with the next word. Controlled 
loops may also be constructed. Using ‘be- 
gin . . . until’ structures, statements be- 
tween are executed so long as the flag at 
the top of the stack remains false. Iterative 
loop type structures such as ‘100 TIMES 
DO’ are handled by taking initial and limit 
loop indexes off the data stack and storing 
them on the control stack. At the potential 
end of the loop the current index is incre- 
mented and compared with the limit. If 
the limit is exceeded a branch is executed 
as described above, otherwise the indexes 
are deleted and the offset skipped to con- 
tinue execution, List 6. 



List 6. Code for diverting control flow if the 
flag at the top of the stack is false. 
OBRANCH: FDBS+2 6809 code 

LOD ,U + + test and delete 
Forth flag 

BNE0B1 <>0, branch if true 
LDX 0.Y get jump offset in 



LEAYY.X add offset 
NEXT 

OBI: LEAY 2.Y skip over offset 

NEXT 

BRANCH FDBS+2 
LDXO.Y 
LEAYY.X 
NEXT 



: ROOTS ( stack ,.cb a start defining newword 'ROOTS' ) 

SWAP MINUS ( ..ca-b ) 

OVER ( ..ca-ba) 

DUP+ ( ..c a -b2a quickerthan 2* ) 

/ ( ..ca -b/2a ) 

ROT ROT ( ,.-b/2a c a save -b/2a } 

/ ( ..-b/2a c/a ) 

OVER DUP* ( ,.-b/2a c/a -b/2a*-b/2a ) 

+ ( ,.-b/2ab**2/4a-a/c ) 

DUP0< ( is top less than 0, ie imaginary roots? ) 

IF ( testflag ) 

DROP DROP ( delete partial results, send <crxlf> to terminal ) 

CR imaginary roots” ( and print message ) 

ELSE 

CR ( real roots, send <crcxlf> to terminal ) 

0 ( convert 16-bit positive numberto 32 bits ) 

SORT ( get back square root ) 

OVEROVER+ ( duplicate both parts of answer and get 1st result ) 

roots are” . ( print message and first answer ) 

,”,and”-. ( print message and other answer ) 

ENDIF ( continue execution } 

CR; ( send <crxlf> and stop compiling return to execution ) 



List 7. Forth code used to calculate the 
roots of a quadratic equation. The stack is 
represented across the page with the top of 
thestackatthe right. 

Using Forth 

List 7 is an example of a Forth routine for 
calculating the roots of a quadratic equa- 
tion, given that the indexes are on the 
stack. Forth has the shortcoming that it 
only handles integer arithmetic so non-in- 



teger results will be incorrect. The 
program example illustrates a number of 
Forth concepts, e.g., stack manipulation, 
passing parameters and terminal output. 
Words used in the program are explained 
in the next article, as are the dictionary and 
compiler. 

Reference 

7. Intel iAPX88 Book. Julv 1981, appendix pp. 
20-36. 



Three flow diagrams compare, from left to 
right, hard code, interpretive code and 
threaded code. 
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Forth language 

Forth consists of words and new words must be compiled and entered into its dictionary. 
Following a description of the dictionary and compiler, Brian Woodroffe discusses 
advanced concepts in this third article. 



Having shown how the address interpreter 
executes lists of addresses to execute 
program commands and that threaded 
code is compact, I will now explain how 
Forth builds these lists, i.e., how it com- 
piles. Each list representing an action is 
rather like a verb in a natural language and 
in Forth is called a WORD. The collection 
of these words, which is Forth, is known 
as the dictionary. The outermost WORD 
in Forth breaks input text down into char- 
acter strings which it then searches for in 
the dictionary. (Spaces are important, for 
instance, ‘-1’ is treated as a negative 
number, whereas T is treated as. the 
arithmetic subtraction operator followed by 
the positive number one.) If the string is 
found, it is executed, otherwise an error 
message is generated. The dictionary also 
needs a mechanism to allow the search to 
. occur. Searching involves a traverse of all 
the words in the dictionary. Each entry has 
a pointer to the previous one (link field), 

, which makes the dictionary a linked list. 
To enable matching of the source text each 
word also has its name in ASCII form 
(name field). Dictionary entries for each 
: word, List 1, have four fields — name 
' field, link field, code field and parameter 
; field. The name field also contains data for 
: use by the compiler (precedence and 
smudge bits) as will be explained, and it 
includes the length of the name to allow 
variable name lengths of up to 3 1 charac- 
; ters. 

For language expansion it is important 
to be able to build dictionary entries for 
new words. This is done by invoking the 
i compiler. When the compiler is invoked 
KForth word V), the language state is 
: switched from execution to compilation. 

; Next, input text is scanned forward for the 
■ next text string which is used to build a 
..newly created name field. The name is 
. ‘smudged’ so that during the building of 
: ; the incomplete definition, the same name 
r cannot be found. This normally prevents 
recursion, but again in Forth, this rule can 
. :be overcome, List 2. Then the linked list 
of the dictionary is updated by copying it 
• into the dictionary link and the address of 
JDOCOL is copied as this new word’s code 
.field, Next, input text is scanned for char- 
' acter strings. As these character strings are 
.matched with words that already exist in 
tthe dictionary, the code field of each word 
"found is copied into the parameter field of 
(the word being compiled. Finally, as the 
'•word to terminate compilation is encoun- 
tered SEMIS is copied as the last word 
'•'of the definition and the Forth program is 
■returned from the compile state to the 



by B. Woodroffe 



execution state. The compiled word is now 
‘unsmudged’ to allow it to be accessed. 

The compile process can be quite iong as 
many dictionary searches have to be made. 
As the dictionary is a linear list and the 



code routines which ultimately have to be 
compiled are at the bottom, it is a long 
search. No speed up algorithms such as 
hashing have been applied to standard FIG 
Forth though there has been experimenta- 
tion 8,9,10 . As so much work is done during 
the compile phase the execution perform- 
ance of newly defined words is nearly as 
quick as predefined words. Further, 
as the first half of the dictionary entry 



List 1. Each dictionary entry has four fields called name field, link field, code field and 
parameter field. 



LFA 

CFA 

PFA 



Dictionary entry 
76543210 
1 p s < 1 e n > 
0 a s c i i 1 

0 a s c i i 2 

0 

1 a s c i i n 



p=precedence, s=smudge, len = length of name 

ASCII characters of word 

d7 set on last character of name 
16 bit address of previous n.f.a. 

1 6 bit address of code routine 

parameters, normally other c.f.as 



List 2. Example of recursion In Forth to calculate a factorial. 

First define 
: MYSELF 

LATEST ( put address of word currently being defined on stack ) 

PFACFA, ( convertto code-field address and compile ) 

( it in dictionary so that it may call itself ) 

; IMMEDIATE ( as thi& word is to execute when in the compile state it has 

'precedence'. ) 



Then use myself in the recursive definition 



FACTORIAL ( i 
DUP 1 = IF ELSE 
DUP 1 - 
MYSELF 

ENDIF 



( end of recursion?, yes leave 1 as 1 1 = 1 ) 
( n, n-1 . . . ) 

( call myself to calculate n-1 ! ) 

( n!; = n*[n-1l] ) 



List 3. Definitions of IF, ENDIF and ELSE. 
: IF 

COMPILE OBRANCH 
HERE 
0 , 

; IMMEDIATE 

: ENDIF 
HERE 
OVER - 
SWAP ! 

; IMMEDIATE 

: ELSE 

COMPILE BRANCH 
HEREO, 

SWAP 

[COMPILE] ENDIF 
; IMMEDIATE 



{ compile into the dictionary the c.f.a. of 'OBRANCH' ) 

( place on stack where we are ) 

( make space for jump offset ) 

I make compiler execute this word, even if in compile state ] 
( where are we? ) 

( calculate offset to HERE executed in IF ) 

( patch in offset to address left by IF ) 



{ compile run time address of routine to skip false statements ) 

{ make space for jump offset ) 

{ get address of where IF was ) 

( use ENDIF to fix address, ENDIF is immediate and to overwrite 
that such that it is compiled ) 
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List 4. Examples of VARIABLE and CONSTANT. 



: VARIABLE 
<BUILDS 

DOES> 

: CONSTANT 
<BUILDS , 
DOES> @ 



( variable is a new parent word ) 

( store in the p.f.a. the value that was top of stack ) 

( start defining what offspring will do } 

( nothing - p.f.a. is storage location for an offspring of type VARIABLE ) 



( constants provide a constant value which has been ) 
{ stored in their p.f.a. ) 



0 VARIABLE ABC ( ABC is an offspring with initial value 0 ) 
1000 CONSTANT K ( K is an offspring of value 1000 ) 



(name and link fields) is only required 
during compilation for fixed applications 
where compilation is not required, these 
fields can be deleted. This dramatically 
reduces the memory requirements of the 
Forth system 11 , and can be especially use- 
ful when the code will be placed in rom. 

To enable the compiler action of Forth 
to do more than just that described above 
certain words need to execute even when 
the language is in the compile state. This 
gives the compiler the full capabilities of 
Forth. These words are generally involved 
in building control structures for the com- 
piler (IF-ELSE-ENDIF, see List 3). 
These words have a precedence bit set 
which the compiler recognises when it 
matches the input text so instead of com- 
piling its code field it executes it. In the 
case of IF the compiler compiles into the 
dictionary the c.f.a. (code-field address) of 
0BRANCH and advances the dictionary 
pointer to allow the as yet unknown offset 
to be placed. It also pushes this address 
onto the stack so that when the compiler 
encounters an ELSE or ENDIF statement 
it can calculate the offset back to the IF 
statement and store the offset there. This 
shows the power of Forth in that the com- 
putational ability of the language is avail- 
able to the compiler and to the user. 
Further there are times when words that 
would normally execute (i.e. have prece- 
dence) need to be compiled (i.e. execution 
action delayed until the word currently 
being defined executes). This is done using 
the word [COMPILE]. Again, it is 
sometimes required to delay compilation 
of a word until the word that contains it 
executes. This is done using the word 
COMPILE. 

Advanced concepts 

With an idea of how the inner interpreter 
(address interpreter) and the compiler 
(text interpreter) work we can now move 
on to advanced concepts including exten- 
sion, vocabularies and virtual memory. 
Forth is either in the compile state, when it 
finds words and copies their code-field 
addresses into the dictionary to form new 
entries, or in the execute state, when it 
executes each code-field address encoun- 
tered. I have shown how certain immediate 
words can override the state, and can even 
execute in the compile state. It is also pos- 
sible to overrule words which are declared 
as immediate and compile them, as in the 
case of ELSE which was described earlier. 

In Forth, even the compiler can be mod- 
ified. Not only can new compiler control 
structures be introduced but also new 



compiler words may be defined. Normally 
the programmer would have to rewrite the 
compiler but with this feature, known as 
extensibility, modification is relatively 
simple. It involves use of the words <B- 
UILDS and DOES> to define a new class 
of words. The defining word defines words 
of this new class. Behaviour of the new 
defining word is determined by the words 
between <BUILDS and DOES>, i.e. 
when a word of the new class is defined, 
behaviour of the new word during compila- 
tion is determined by what comes between 
<BUILDS and DOES>. When a word of 
this new class executes, it executes the 
words following DOES>. To allow the 
parent class-defining word to access its off- 
spring'(class-defining word to access its off- 
spring (class-defined word), the parameter- 
field address (p.f.a.) of the latter is placed on 
the data stack. Two simple examples from 
the Forth compiler are VARIABLE and 
CONSTANT, List 4. These can easily be 
expanced to form arrays and tables. The 
word V is also a defining type. When off- 
spring of are executed they call the word 
DOCOL which decides how to execute their 
parameter field. An alternative 
to DOES> is used to define V; the assem- 
bler is invoked so that the parent-word 
execution field is machine code and not 
Forth but in other respects it is the same. 

The major part of Forth is the dictio- 
nary, and to enable different problems to 
be solved in different areas of the dictio- 
nary each problem is given its own voca- 
bulary. The dictionary may have many 
vocabularies alongside the normal basic set 
of FORTH, ASSEMBLER and EDITOR. 

1 Using vocabularies means that the same 
word may have different meanings, de- 
pending on which vocabulary is active. 
FIG-Forth has two active vocabularies - 
CURRENT and CONTEXT. The former 
is the one in which words are defined and 
the latter is the one which is searched first. 
All vocabularies are linked to FORTH 
(Forth’s definition in Forth). Much debate 
is taking place on the subject of vocabula- 
ries concerning the subject of searching 
vocabularies 11 " 13 . 

Virtual memory 

Memory is the most precious resource of a 
computer and although Forth makes very 
efficient use of it, there are still times when 
programmers wish it was infinite. Disc 
memory is much cheaper than semicon- 
ductor memory but it is also slower. By the 
concept of virtual memory, the memory 
space available to the programmer is ex- 
panded beyond the main memory to in- 



Forth words used in last 
month s program 
example 



DUP I i» ' U * ’i S* i ' 'O 

SWAP ’I" c! jfl'l ‘AJ ■ I» ■"* 'Its .. 
OVER m. ? i op* f 'ii n - o 

ROT '«“s *hii« ■' *r 2 d 

Arithmetic operators 

MINUS * ' « » s *h <* ipvirn c merit 

: 

U i ,ir»,L t' .re h 'i •. p -arr 






Control operator* 

IF t .• i n 1“ rfi • iM<F 

■ ll * Ml ) s«i, st \iF (i END “ 

ELSE r » k*“ t * ' l i it i fc TR^F 

ENDIF « i i «nl r il I,', j 

TeiininHl operator* 1 

CH 'ii« i r ri hm r d In » 



elude disc storage so memory capacity as ; 
far as the programmer is concerned is only 
limited by the capacity of the disc. Ini . 
Forth, the virtual-memory cancept is only; 
applied for data whereas in most processor 
applications (e.g INS16000 series) it is also 
applied for program storage. Through use.; 
of the word BLOCK, the programmer can, l 
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List 5. Representation of algorithm used for benchmark test, see Table 1. 



visualise the disc memory as processor 
memory. The Forth operating system re- 
covers data from disc and places it in a 
buffer to make it accessible to the user 
program. Many such buffers exist in the 
host processor and BLOCK uses an algo- 
rithm that determines whoch blocks 
should be maintained in the store and 
which should be written back to the disc. 
With the right algorithm the number of 
disc accesses will be minimal and the 
apparent memory-access time low. 



Space and time 

I have shown how the Forth dictionary is 
created (i.e. its form), how it may be ex- 
tended by compiling and how any proces- 
sor may readily simulate the virtual Forth 
machine by means of indirect threaded 
code. As mentioned earlier, by introducing 
the concept of threaded code, execution 
speed is traded for code space. So one can 
expect that Forth is not as fast as the host 
processor’s own code although it may ap- 
proach it where many subroutine calls are 
made. Execution of a process defined in 
the source language divides into two parts; 
one is the examination of source 'text to 
find out what action is to be taken and the 
second is execution of the actions by the 
processor. In most systems the first part is 
carried out by compiling source text in the 
machine code of the host machine. In 
Forth this means compiled into the thread, 
the machine code of the hypothetical 
machine. Target machine code is then run 
in Forth using the address interpreter. 
Running time can be traded for space by 
choosing an intermediate language of suit- 
able complexity. Running time perform- 
ance of compilation has no effect on the 



CRAY-1 Fortran 

68000 assembly language (8MHz) 

PDP11/70C 

VAX 11/780 (C/Fortran/Pascal) 

8088 assembly language (15MHz) 

6809 assembly language 

PDP11/40C language 

Z80 assembly language (4MHz) 

6809 IMS Pascal compiled (2MHz) 

PDP11/70 Decus Forth 

Z80 Microsoft Basic compiler 

8088 Pascal {Softech compiler) (15MHz) 

68000 Forth (8MHz) 

6809 FIG Forth (2MHz) 

8088 FIG Forth (15MHz) 

8086FIG Forth (12.5MHz) 

6809 FIG Forth <1.5MHz)* 

Z80 Forth (Timin) (4MHz) 

Z80 Forth (Laboratory Microsystems) (4MHz) 
Z80 FIG Forth (4MHz) 

6809 IMS Pascal P-code (2MHz) 

6809 Basic 09 (2MHz) 

6502 FIG Forth (1MHz) 

6809 TSC Basic (2MHz) 

Z80 Microsoft Basic 
APPLE integer Basic 
TRS80 Microsoft Basic . 

6809 Computerware Basic 



8190 CONSTANT SIZE 
0 VARIABLE FLAGS SIZE ALLOT 
: DO-PRIME 
FLAGS SIZE 1 FILL 
0 

SIZE 0 DO 
FLAGS I + C@ 

IF 

IDUP + 3 + DUPI + 

BEGIN 
DUP SIZE < 

WHILE 

0 OVER FLAGS + C! 

OVER + 

REPEAT 
DROP DROP 1 + 

THEN 

LOOP 

. primes” ; 



running time of the application program, 
which leads to the view that the compiler 
should do as much work as possible. Un- 
fortunately, compiling to machine code 
using a simple processor with limited 
addressing capabiiityj such as a current 8- 
bit microprocessor, often results in the 
code not fitting into the memory so an 
intermediate target code is chosen, with 
the accompanying penalty of interpreting 
it. Forth’s address interpreter costs some 
tens of percent. 

Other losses occur because microproces- 
sors are not zero-address devices so the 
zero-address function has to be simulated. 
Memory-space benefits are illustrated by 
the amount of memory required for a 
Forth system, which is typically 8Kbyte 
(may be rom) for virtual-machine simu- 
lation, the Forth compiler, i/o drivers, 
etc., and 8K for stacks, virtual-memory 
buffers and the user dictionary. 



1. 5-5.0 

4.0 

5.1 

6.1 
6.8 
8.9 

11.8 

18.6 

19.4 

27 

45 

55 

64 

67 

75 

78 

85 

105 

238 

287 

830 

1920 

2320 

2250 

4303 



( allocateB191 bytes foran array ) 

( fill array with T, <true> ) 

( counter ) 

I set up a DO loop of 8190 times ) 

( I is loop counter, get relevant flag ) 

( C@,C! are byte versions of @,l ) 

( stack is . . . count, prime, K ) 

( begin a block ) 

( array index < size ? ) 

( test flag, to see if exit block ) 

( set relevant flag false, FL^GS[K] ) 

( K:=k+prime ) 

( end of block, loop back ) 

( delete prime, K; one extra primefound } 
( end of IF ) 



Forth is also fast because of the explicit 
use of the stack. In languages using the 
assignment operator, data normally resides 
outside the stack. It is brought to the 
stack, operated on, and finally placed back 
into the store. If the next statement uses 
the same variable it is once again taken 
from the store and placed on the stack. 
When computing partial results this causes 
excess memory traffic. Unless optimiza- 
tion is used this redundant memory activ- 
ity will cause delays. Forth avoids this 
because normally data only resides on the 
stack. No unnecessary memory space or 
time is taken up by temporary variables. 

It is interesting to compare Forth’s per- 
formance with the commonly used lan- 
guage for microprocessors, Basic. Systems 
using Basic have little compiler action, the 
source text being saved in memory, al- 
though the key words are converted into, 
internal tokens. During program execu- 
tion, each token is parsed and acted upon 
in turn so the source of Basic’s execution- 
time interpreter is close to that of ttie 
source text whereas Forth’s source for the 
running-time interpreter is close to the 
language of the host computer. As all the 
work in a Basic system is done while the ’ 
program is running the speed penalty is 
high, usually at hundreds of percent. 
Further, since Forth compresses object 
code into 16-bit addresses (code-field 
addresses are the equivalent of tokens) it is 
as efficient as Basic in terms of memory 
space. 

Processing speed is an emotive issue 
without benchmark tests and unbiased 
benchmarks are notoriously difficult to 
produce. Table 1 was derived using the 
Seive of Eratosthenes (see List 5) and 
seems fair 14 . Qualitatively, it confirms 
what one could expect - assembly-lan- 
guage is faster, followed by compiled lan- 
guages with interpreted Basic well behind. 
The table also shows how well the 6809 
compares with newer and more popular 
designs and that it compares with at least 
one 16-bit device, the 8086. I would^ttri- 
bute this to the instruction set as was 
shown in the analysis of Forth word 
NEXT. A more elaborate, special-purpose 
instruction set does not necessarily lead to' 
a more effective processor. This has been 
shown in recent research into reduced in- 
struction set computers. 

continued on page 61 



* Used in my design as described in Wireless World 



Table 1 . Relative speeds of various processor and languages. 



Processor/language Time in 

seconds 



.11 

1.12 

1.52 



( end of DO . . . LOOP block ) 
( print number of primes ) 
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List 6. Array boundary checking using <BUILD . 
: ARRAY 
<BUILDS 

OVER -SWAP OVER 
SWAP,, 

DUP + ALLOT 
DOES> 

DUP ROT 

SWAP DUP 0< 

IF array bound error, too low" QUIT THEN 
OVER 2+ @ 

OVER < 

IF array bound error, too high" QUIT THEN 
DUP + 

+ 4 + 



Forth problems 

So far, only advantages of Forth have been 
discussed but it has some disadvantages. 
The most obvious of these is notation. For 
the beginner, reverse-Polish notation and 
the lack of an assignment operator (:=) are 
considerable problems. Practice lessens 
the problems though program comments 
and stack diagrams generally remain 
necessary to show what is going on. 

Floating-point arithmetic is not stan- 
dard and all data manipulation assumes 16- 
bit two’s-complement arithmetic, but it 
may be programmed in 15 . This shows 
Forth’s origin in the control field of com- 
puting. However, many Forth pro- 
grammers maintain that most problems 
can be reduced to scaled-integer arith- 
metic. This drawback makes one aware of 
the processing cost of floating-point arith- 
metic. Forth does not use ‘data typing’. 
This means that integer operations are 
used when logical operations are being per- 
formed (‘0=’ for NOT). There are also 
separate operators for 32-bit arithmetic. 
Computer languages can usually apply dif- 



. .DOES>. 

( low high . . assumes low chigh ) 

{ delta low delta ) 

{ store low is p.f.a., delta as p.f.a. +1 ) 

( that much storage, byte address machine ) 

( . . p.f.a. p.f.a. index ) 

( . . p.f.a. required-delta flag ) 

( . . p.f.a. reqd allowed I 
( . .p.f.a. reqd flag } 

{ word index to byte index ) 

{ add index-skip parameters, leaving array 
address ) ’ 



ferent operations for the same operator by 
data typing. 

A more serious drawback is the lack of 
built-in data structures - not that Forth is 
any worse in this respect than Basic or 
Fortran. What is lacking are the type of 
data structures available in Pascal. In com- 
mon with the formerly mentioned lan- 
guages, Forth lacks a method of checking 
for overflow and array boundary condi- 
tions in normal operation. But as shown in 
List 6. This can be programmed in. 
Naturally, this process increased execution 
time but when the application works a 
simpler version of array can be coded by 
missing out the check. Finally there is as 
yet no file management software. Access to 
disc information has to be done using 
BLOCK numbers. 

Summary 

I have shown that the programmer is re- 
leased from the instruction set of the host 
computer with little time penalty by apply- 
ing threaded code. Using the compiler, 
one may easily extend the Forth instruc- 



tion set to suit one’s own application. As 
the whole dictionary is available all of the 
time (ranging from virtual-machine in- 
structions to <BUILDS . . . DOES> 
structures) the programmer can tackle low 
or high-level problems, such as i/o driving 
or word processing, with equal ease and 
efficiency. The consistent nature of the 
compiler and text interpreter allow easy 
interactive testing of code before it is com- 
piled. Reverse-Polish notation simplifies- 
the compilation process and allows it to be 
completed in one pass in a small memory. 
Virtual memory and vocabularies further 
enhance Forth by offering infinite data 
space and better control of the application 
software respectively. However, shortcom- 
ings of the language may prevent it from 
being applied to larger computers where 
its space-saving features are less useful. 
But it will continue to find many applica- 
tions in small and interactive systems and 
real-time applications including hardware 
simulation, video games and test-equip- 
ment control. 
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